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Quantum Chromodynamics (QCD) is thought to be the correct microscopic theory of the 
strong force. Composed only of eight colored gluons and thirty-six colored quarks, its dynamics 
are believed to generate the observed hadronic spectrum of more than a thousand particles 
and account for many of their interactions - from the formation of nuclei to the rate at which 
a p-meson decays. Unfortunately, its predictions are often beyond the reach of of our current 
understanding of quantum field theory. Although the theory is perturbative at high energy, 
the phenomenon of confinement makes direct QCD calculations of physical observables quite 
intractable. 

Through a process called factorization, one systematically separates the soft, incalculable 
physics from the perturbative high-energy physics. The soft physics is then grouped into 
matrix elements which can be measured in experiment. Once fixed, these matrix elements can 
be used to predict the outcome of other experiments. This process allows one to indirectly test 
the validity of QCD in the perturbative regime. Moreover, the matrix elements themselves are 
of interest. Acting as a window to the nonperturbative physics of QCD, these objects allow 
us to link the quark and gluon degrees of freedom of the microscopic theory to macroscopic 
observables such as hadron momentum and spin. 



This thesis presents a study of how these ideas can be used to understand the origins of 
nucleon spin. Beginning with a general overview of QCD as a quantum field theory in Chapter 
1, we then move on to study inclusive Deeply-Inelastic Scattering (DIS) in Chapter 2. Inclusive 
DIS presents one of the earliest examples of a factorizable process. The soft physics relevant 
to this process is related to parton probability distributions, and provided some of the first 
evidence that protons have nontrivial internal structure. We discuss the one-loop corrections 
to this amplitude, the definitions and interpretations of the distribution functions, and a proof 
that the hard and soft physics can be consistently separated at high energy to all orders in the 
QCD coupling. 

The original research presented here concerns Deeply- Virtual Compton Scattering (DVCS), 
a process closely related to DIS. The special kinematics of DVCS allow one to measure new 
hadronic parton distributions. As explained in Chapter 3, these distributions contain infor- 
mation on the origins of nucleon spin that DIS cannot provide. We calculate the one-loop 
coefficient functions of these new distributions as they appear in DVCS and show that the soft 
physics inherent to this process can be separated from the hard at this order. Armed with the 
methods of Chapter 2, we then argue that this property persists to all orders in the strong 
coupling. Using our result, we calculate an operator product expansion for the product of two 
electromagnetic currents that is valid for a certain class of off-diagonal matrix elements. This 
constitutes a generalization of the expansion first considered by Wilson in 1969, which has 
been used extensively over the past thirty years. 

Chapter 4 is devoted to the complications associated with transversely polarized DIS. We 
discuss the special circumstances that arise here due to the introduction of a transverse di- 
rection. In particular, the gauge-invariance of QCD demands that these effects be inseparable 
from multiparton correlations within the nucleon. Among other things, this fact complicates 
the scale evolution of the relevant distributions. We show that these complications are re- 
moved at the one-loop level by taking the limit of large N c . Although this simplification has 
been known for several years, its origins have remained a mystery. Our new approach to the 
calculation allows us to directly understand these origins. We discuss the possibility of similar 
simplifications at the higher loop level and for other important distributions. 
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0.1 Introduction 



In the beginning, God created the Heavens and the Earth. After that, it has been the goal of 
physicists to understand the laws they obey. The dream of physics is a single set of laws which 
describe all phenomena from the largest galaxies to the smallest atoms and beyond. Recently, 
physics has come to a plateau of understanding in terms of which there are only a few physical 
processes which cannot be explained quantitatively. The physics of the very large is governed 
by Einstein's General Relativity Q. This theory has had tremendous success in accurately 
predicting the orbits of the planets in our solar system, the bending of light by our Sun, and the 
behavior of radiation emitted by distant galaxies. The physics of the very small is thought to 
be described by the Standard Model. First formulated in its present form in 1973Q, this model 
is well-known for its extraordinarily accurate predictions of the properties of many subatomic 
particles, the excitation spectra of atoms, and the existence and theory of radioactive decay. 
Perhaps the most famous example of a precision calculation within the Standard Model is the 
calculation of the anomalous magnetic moment of the electron to 11 decimal places ||. This 
quantity was measured to 7 parts in a trillion [|| as a test of the Standard Model, making it 
the most precisely measured quantity in the history of the human race. Agreement between 
experiment and theory to this degree of precision is unheard of in any other branch of physics, 
making this aspect of the Standard Model the most precisely tested theory in the history of 
physics. 

The Standard Model (SM) consists of three parts. These can be thought of as Electricity 
and Magnetism, the Weak Interactions, and the Strong Interactions. Electricity and Mag- 
netism in the SM is a relativistic quantum version of the familiar classical Electrodynamics 
embodied by the Maxwell Equations and Lorentz force law. This is the sector of the SM which 
is largely responsible for all of the macroscopic objects we see around us : computers, light 
bulbs, cars, etc. It is also responsible for almost every branch of science. It is the driving 
force behind chemistry, metallurgy, condensed matter physics, biology, etc. Well understood 
as a classical theory, its marriage with quantum mechanics led to the first realistic quantum 

lr The 'electroweak' sector was formulated in pieces throughout the 1960's. The most notable contributions 
are from S. L. Glashow Q], S. Weinberg and A. Salam [gj. The 'strong' sector was not written in its present 
form until 1973, as discussed below. 
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field theory. Quantum field theories have dramatic implications on the structure of matter in 
general, some of which were studied for the first time in the context of quantum electrodynam- 
ics (QED). In particular, QED employs photons not only as the physical quanta of light, but 
also as carriers of the electromagnetic force. In QED, charged particles interact by exchanging 
massless spin-1 photons. This idea of forces being mediated by vector bosons^ plays a funda- 
mental role in the other two sectors of the SM. Indeed, one can understand many aspects of 
these elusive forces simply by exploiting an analogy to QED. 

The Weak Interactions manifest themselves mainly in certain forms of radioactive decay. 
The processes of beta decay, electron capture, and positron emission first studied in the late 
1890's^Jall take place via the Weak Interaction. Their discovery and subsequent study provided 
the first evidence of the existence of this sector of the SM. However, it was not understood 
that these processes required the introduction of a new force until the discovery of the neutron 
in 1932. Before this monumcntous discovery, it was believed that beta decay was simply the 
emission of one of the 'nuclear electrons' needed to get the observed nuclear charge and mass 
right in the contemporary model of the nucleus. The discovery of the neutron forced scientists 
to reconsider the process that leads to beta decay in the nucleus. More recently, many other 
manifestations of this interaction have been seen experimentally, most notably the decay of 
muons and pions and the existence of the so-called 'strange' particles. However, this force has 
so many intricate features and is so well-hidden in ordinary matter that it was not completed 
for more than twenty years after these discoveries. Its completion at the end of the 1960's 
won Glashow ||, Weinberg j|, and Salam Q] the Nobel Prize in Physics in 1979. G. 't Hooft 
and M. Velman's 1972 proof that the theory is internally consistent and complete jlO| won a 
second Nobel Prize in Physics in 1999. The experimental discovery by C. Rubbia and S. van 
der Meer of the W- and Z-bosons jllj proposed to mediate the weak interactions won them 

2 To highlight the difference between two basic particle types, particles which obey Bose-Einstein statistics 
are called bosons and those which obey Fermi-Dirac statistics are called fermions. This behavior finds its 
origins in the particle's spin, or intrinsic angular momentum. Particles with integer spin are invariably bosons 
and those with half-integer spin are fermions. 

3 True matter radiation was first discovered by H. Becquerel M in 1896. Originally discovered in samples of 
uranium ore, his 'uranic rays' were not understood as properties of individual atoms until M. Curie For a 
comprehensive history of the discovery of radiation, see [B|. 
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the Nobel Prize in 1984. 

In this work, we will mainly be concerned with the final sector of the SM. The Strong 
Interactions are responsible for the structure of nuclei and some of their decay modes. Alpha 
decay and the rare decays of spontaneous fission and proton and neutron emission are mediated 
by this force. It was alpha decay that first permitted the discovery of the nucleus itself. In 1909, 
H. Geiger and E. Marsden performed an experiment in which they found that approximately 
one in eight thousand alpha particles scattered off of a gold foil are scattered at an angle 
greater than 90° [Q. The contemporary model of the atom, due to J. J. Thompson Jj"3"| |, 
involved many negatively charged electrons embedded in a jelly-like medium of smeared-out 
positive charge. Since electrons are many thousands of times lighter than alpha particles, this 
model did not provide any mechanism for the observed back-scatter. 

E. Rutherford proposed that atoms are composed of a very heavy positively charged hard 
core, or nucleus, surrounded by Thompson's negatively charged electrons. This core was 
estimated to be approximately one hundred thousand times smaller than the atom [fl4"|| . Ho 
argued that the electric field of this heavy nucleus was responsible for the back-scattering 
observed in recent experiments, rather than the light electrons or the diffuse jelly. This had 
the unattractive consequence that the vast majority of the volume of an atom is empty space. 
However, future experiments proved the correctness of this model. Since the attraction of 
electrons to the nucleus is accomplished by the electromagnetic field, it seemed the only thing 
left to study was the nucleus itself. 

Radioactivity became physicists' 'window on the nucleus' when it was discovered that 
almost all radioactive processes are nuclear in origin. Physicists were beginning to understand 
that the nucleus itself is a strongly interacting bound state. Rutherford's experiments in 
1919 showed that one could obtain oxygen by bombarding nitrogen with alpha particles Jj"5[ . 
Through years of these experiments and intense theoretical work, it became clear that nuclei 
are composed of two heavy particles, collectively called nucleons. 

The positively charged protons responsible for the electric charge of nuclei were discovered 
first partly because they exist independently of other nucleons. These objects are the nuclei 
of the lightest isotope of hydrogen and were called 'positive electrons' by Rutherford until 
it became clear that they are really quite different from electrons. Neutrons were the last 
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constituent of 'ordinary' matter to be discovered since they do not carry an electric charge and 
decay into protons when isolated from nuclei. Long thought to be an extremely tightly bound 
system of a proton and an electron, neutrons were not given separate identities until 1932. In 
that year, J. Chadwick showed that the neutral radiation resulting from the bombardment of 
beryllium with alpha particles was not simply electromagnetic radiation, as widely believed, 
but was in fact a new kind of particle which he called the neutron [jl6| . He received the Nobel 
Prize for this discovery in 1935. The neutron solved many of the paradoxes the new quantum 
theory brought to nuclear physics. At last, the nucleus was seen as a sensible quantum system. 
All that was needed for a complete theory was an interaction responsible for the formation 
and stability of the bound states observed. 

Once it was understood that the nucleus is made of positively charged protons and neutral 
neutrons, it became apparent that the electromagnetic repulsion inherent in nuclei must be 
overcome by a tremendously strong force. Called the Strong Force for obvious reasons, this new 
interaction was proposed to involve only protons and neutrons. Neutron, proton, and alphaR 




scattering data on various nuclei gave scientists of the time a good amount of information on 
the excitation spectra, but contemporary models were mainly empirical in origin. A modest 
degree of success was obtained by these early models, most notably the composite nucleus 



A partial solution to the problem of the origins of the nuclear force came with the dis- 
covery of a new triplet of particles and the exploitation of an approximate symmetry seen in 
nuclear interactions. It was proposed by H. Yukawa in 1934 jl9| that if this new force is in- 
deed fundamental, a description based on the newly formulated theory of relativistic quantum 
electrodynamics may lead to new insights. He argued that since the long-range nature of the 
electromagnetic field is attributed to the masslessness of the photon, a short-range nuclear 
force may be generated by massive particle exchange. Obtaining the range of the strong force 
from scattering experiments, he found that the mass of this new particle, called a meson, 
should be about two hundred times the mass of the electron. 

In reality, there are five particles in Yukawa's predicted mass range. Only three of them 

4 o-particles were long known to be 4 He nuclei. 




model of N. Bohr |l7| and the celebrated nuclear shell model |lq |, but there was much left to 



be desired. 
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have the right properties to be candidates for the meson required in his theory. The other two 
appear as decay products of these three,0 and have nothing whatsoever to do with the strong 
nuclear force. [] This created a great deal of confusion in the scientific community for about 
ten years, but in 1947 C. F. Powell and collaborators were finally able to show conclusive 
evidence testifying to the existence of two charged strongly interacting bosons with masses 
approximately 270 times the electron mass ^0|. For his prediction of their existence, Yukawa 
received the Nobel Prize in 1949. A third meson in this mass range was discovered in 1948, 
this one with no electric charge. The existence of this triplet forced physicists to take another 
look at the structure of the strong interactions. 

For many years, it had been a known fact that to a very good approximation the strong 
force does not distinguish between protons and neutrons. In fact, if we take Yukawa's meson 
exchange force seriously, exchanges of charged mesons actually change neutrons into protons 
and vice versa. This new symmetry led physicists to consider protons and neutrons as different 
manifestations of the same particle. In these models, protons and neutrons are considered as 
components of a two-dimensional vector in a new space. This internal space is called 'isospin 
space' since it is mathematically identical to ordinary quantum-mechanical spin. The mesons 
are considered a vector in isospin space and work to rotate the nucleon states. This symmetry 
is actually a very good approximation to the true form of strong interaction physics, and had 
been useful in classifying particles and relating certain strong processes to each other. It also 
became instrumental in revealing to us the true nature of the strong interaction. 

Along with the 1950's came a new generation of accelerators in which one could liberate 
larger and larger amounts of energy, which nature used to form exotic new particles. At this 
point, it was expected that all of the fundamental particles were known and these higher 
energies would only reveal more detail about the structure of the strong interactions and the 
behavior of nuclear excited states. However, hundreds of new strongly interacting particles 
had been discovered by 1960. These new objects were classified according to their charge, spin, 
decay properties, and isospin in an effort to make some sense of this 'zoo' of particles. The 

5 well, two of them... 

6 These objects, the are actually heavier copies of electrons. They provide the first evidence of the 
second generation of fermions. 
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situation here is analogous to Mendeleev's arrangement of elements on his periodic table. Just 
as his arrangement led to a greater understanding of chemistry and eventually the atom, this 
arrangement of particles led to the creation of the SM. In order to accommodate all these new 
particles, the word 'meson' was extended to mean 'integer spin strongly interacting particle' 
and the mesons of Yukawa's theory were called 7r-mesons, or pions. The word 'baryon' was 
introduced for strongly interacting fermions and 'hadron' as a blanket term for all strongly 
interacting particles.]] 

One of the most striking patterns in the particle zoo emerges when we arrange them 
according to the length of time it takes for them to decay. This exercise reveals three distinct 
time scales, showing physicists clear evidence of three different forces. The shortest decay times 
are on the order of 10~ 21 to 10 -24 seconds, and involve only strongly interacting particles. 
These are attributed to the strong force. The mid-range lies on the order of 10~ 17 to 10~ 19 
seconds and is associated with the electromagnetic interaction. These decays are signaled by 
the appearance of photons in the final state. The longest lived particles have half lives on the 
order of 10~ 10 to 10 -8 seconds^] These decays are attributed to the weak interaction. Weak 
decays are signaled by the existence of leptons (like electrons and muons) in the final state, as 
well as missing energy attributed to the elusive neutrinos postulated to conserve energy and 
momentum in the weak decay of the neutron. 

Many of the observed decays fit nicely into this scheme and physicists were given confidence 
in this three force model. However, a small number of mesons and baryons were found to 
behave very strangely. These hadrons participate in decays that look very much like strong 
decays, but they live entirely too long and certain symmetries seen to a very high degree of 
accuracy in all true strong interactions were badly broken by these decays. For example the 
heavy K-mesons, with a mass of about a thousand electrons, were found to have the competing 

7 Nuclei are not included in these designations. Although the 14 N nucleus is indeed a boson which participates 
in the strong interaction, it is never referred to as a meson. The terms 'meson' and 'baryon' will be defined 
more clearly below. 

8 Two notable exceptions are the neutron and the /U^, both of which have uncharacteristically long lifetimes 
due to phase space suppression. 
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decay modes 

-► 7T+ + 7T° . (1) 

The first mode is unambiguously a weak mode signaled by the neutrino . The second appears 
to be a strong mode since all of the objects participating are strongly interacting particles. If 
this is the case, it should dominate the decay and cause the half-life of the K + to be on the 
order of the characteristic time scale of the strong interactions. In reality, the half-life of the 
K + is known to be ~ 10~ 8 s, characteristic of a weak decay. 

A closer inspection of the second decay mode reveals that it docs not respect the isospin 
symmetry found to such a good approximation in the strong interactions. This is easy to see 
since the K-mesons (kaons) have isospin-1/2 while pions have isospin- 1. The final state can 
only have isospin-2, while the initial state has isospin-1/2. All of these strange new decays 
were found to violate isospin. Physicists of the time had to concede that we were seeing a 
new manifestation of the weak interaction. It was postulated that these particles posses a 
new quantum number (strangeness) which must be conserved by the strong interactions, but 
may be changed by one unit via the weak interaction. Mesons could be found only with 
strangeness and ±1, but there were baryons with strangeness ±2 and ±3 as well. A baryon 
with strangeness 5 = 2 has to decay first to a baryon with 5=1, then to a baryon with 5 = 0. 

Since the strong interaction is postulated to conserve strangeness and (approximately) 
isospin, classification of its states in terms of these quantum numbers can give us insight into 
its true form. This observation led physicists to plot the particles of a given spin on axes 
labeled by these two quantum numbers.^ Figure 0.1 shows this arrangement for the lowest- 
lying spin-0, -1/2, -1, and -3/2 hadrons. The mass-splitting within these multiplets is small 
compared with the energy required to jump to the next group of hadrons with the same spin, 
so to a first approximation these objects can be considered degenerate. This is much like 
the approximation of isospin symmetry which proved so fruitful in the past. In fact, isospin 
multiplets (particles lying along a horizontal line in Fig. 0.1) are very close to degenerate. 
One can think of each jump away from the 5 = axis as a shift of about 350 electron masses, 

9 There are in fact two quantum numbers associated with isospin, as shown in Appendix^. In this plot, the 
isospin casimir I 2 has been suppressed and only the 3-component of isospin, 1%, is considered. 



7 




(a) (b) 

Is 

I -3/2 -1 -1/2 1/2 1 3/2 



-1 -1/2 1/2 1 




(c) (d) 

Figure 1: Gcll-Mann's Eightfold Way for (a) spin-0 and (b) spin-1 mesons. These mesons form 
the 8 of 3 x 3. If SU(3) were exact, each multiplet would be degenerate and would not mix 
with the 1 of 3 x 3. However, since in reality SU(3) is badly broken, the mathematical ideals 
r}8 and u>8 mix with the singlets 770 and ui n to form the physical states 77, i]' , u, and <j>. (c) and 
(d) show the baryon spin-1/2 octet and spin-3/2 decouplet, respectively. In each octet there 
are two particles in the center : one with isospin-0 and one with isospin-1. The fact that they 
lie on top of each other is due exclusively to the fact that I have chosen to suppress I 2 . 
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while the splitting along the horizontal axis is closer to 20 m e .0 If the analogy with isospin 
were exact, we would assign the spin-1/2 baryons to the fundamental representation of some 
internal 8-dimcnsional space. However, this procedure would not allow us to understand the 
reason why isospin is a much better symmetry than this new group. Ideally we would like 
to place these particles in representations of some symmetry group which not only allows us 
to account for all our new low-lying states in a natural way, ^ but also contains our isospin 
symmetry as a subgroup. This means that we can break the full symmetry group in such a 
way that isospin remains unbroken. 



Our ideal was achieved in 1962 by M. Gell-Mann's Eightfold Way |2l). Gell-Mann found 
that the observed hadron spectrum could be placed into representations of the special unitary 
group in three dimensions, SU(3). Many of the mathematical properties of this group are 
discussed in Appendix |c[ Here, it is important only to know that there are two inequivalent 
fundamental representations of SU(3) from which one can build all other representations. 
Calling these two representations 3 and 3, we find that the mesons of spin-0 and -1 form 
representations of 3 x 3 and the baryons of spin-1/2 and -3/2 form representations of 3 x 3 x 3. 
If we postulate the existence of objects in the fundamental representations of this SU(3), as 
M. Gell-Mann and G. Zweig did in 1964 [ p2[ , say quarks for 3 and antiquarks for 3, the last 
sentence says that mesons are mathematically equivalent to bound states of one quark and 
one antiquark and baryons are mathematically equivalent to bound states of three quarks. At 
this point, the idea of quarks and antiquarks is merely a mathematical construct. In order to 
suggest the actual existence of these particles we would have to propose a full theory of them 
explaining, in particular, why we haven't seen them in the lab. 

Even if they don't actually exist in nature, quarks provide a nice way to organize the known 
particles. If we name the quarks in the fundamental representation 'up', 'down', and 'strange' 
(u, d, and s), we find that the new strangeness quantum number is simply the number of strange 
quarks in the hadron (with a minus sign from convention). Thus a baryon with S = —2 is 
composed of two strange quarks and one other quark. If we suppose that the strange quark 

10 In contrast, the jump to the next multiplet of spin-1/2 baryons is approximately 1000 m e . 

llr This is not easy in a group with an 8-dimensional fundamental representation since the spin-3/2 baryons 
form a 10-dimensional representation. 
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is heavier than the other two, breaking the SU (3) invariance of the theory, the multiplets 
break down exactly into the isospin multiplets we observe. The residual isospin symmetry 
is nothing but the remaining symmetry between up and down quarks. The electromagnetic 
charge assignments of all known particles can be predicted from the quark content if d and s 
quarks are given a charge of —1/3 and u quarks are given +2/3, in units of the proton charge. 
Conservation of strangeness is easily understood in this model if we postulate that the strong 
and electromagnetic interactions cannot change the type of quark, but the weak interaction 
can. In fact, almost all of these strangeness changing weak decays can be understood in terms 
of the basic weak decay of an s quark into a u quark and a negatively charged state (like a 
tt~). Other weak decays (like that of the neutron) are understood as a change from d to u or 
vice versa. This idea completely changed the shape of the weak interactions in the years to 
come. Spin assignments can be understood by postulating that quarks (and antiquarks) have 
spin-1/2: Since baryons are representations of 3 x 3 x 3, they may have spin-1/2 or -3/2.0 
Mesons will have integer spin since they are representations of 3 x 3.0 Hence we see that 
if quarks actually exist as particles, many of our problems will be solved. Much in the way 
that the discovery of protons, neutrons, and electrons reduced the number of fundamental 
particles from the eighty or ninety different atoms plus photons and other radiation to four, 
the discovery of quarks would reduce the hundreds of strongly interacting particles to three 
fundamental objects. 

However nice Gell-Mann and Zweig's theory of quarks looks at first sight, it has some major 
problems. One problem is that no mechanism is given for binding the quarks into hadrons. 
Many models were proposed in analogy with QED to solve this problem, but no acceptable 
solution could be found. The new force would have to be extremely strong to account for the 
fact that hadrons appeared to be fundamental to the experimentalists of the time and no free 
quarks had been seen in the lab. In fact, no particle of fractional charge had ever been seen 
in any experiment anywhere. The fact that it was necessary to give quarks fractional charges 
to account for the observed hadron charges was greatly ridiculed at the time. 

12 Higher spins can also be obtained in this model by adding orbital contributions; these are always integer, 
so do not change the fact that the state will be a fermion. 

13 Herein lies the true meaning of the terms baryon and meson. A baryon is a three-quark bound state while 
a meson is a bound state of a quark and an antiquark. 
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Even more devastating is the problem of quantum statistics. Suppose, for example, we wish 
to construct the wavefunctions of the lowest-lying spin-3/2 baryons assuming the existence of 
quarks. Since we consider ground states, it is natural to assume that our wavefunctions are 
not orbitally excited (i.e. the orbital angular momentum of our states is zero). The only way 
to obtain spin-3/2 from three quarks without the help of orbital angular momentum is to align 
all of their spins, making this part of the wavefunction fully symmetric under quark exchange. 
Fermi-Dirac statistics then demands antisymmetry of the final SU(3) sector. However, the 10 
of SU (3) is a fully symmetric representation. Hence the observed spectrum seems to require 
us to disallow the Pauli exclusion principle^] j2j| and be satisfied with a quantum state which 
is symmetric under the exchange of identical fermions.]^] In the non-relativistic theory, this 
principle had been made independently of the rest of quantum mechanics and could be argued 
away as something that is not valid in this arena. However, it had been shown by Dirac that 
a relativistic theory in which fermions do not obey the Pauli principle does not make sense. It 
was this argument that seemed like the devastating blow to the quark model. Quarks could not 
exist as real particles because they did not obey the basic requirements of a sensible quantum 
theory. 

In reality, this blow was not as devastating as it seemed. In 1964 it was argued by O. W. 
Greenberg, M. Y. Han and Y. Nambu (24| that an extension of the simple quark model could 
solve this problem. If one endowed quarks with one other quantum direction in which they 
could be antisymmetrized, a sensible quantum theory could be constructed. There would have 
to be at least three directions in this new internal space since each quark in a baryon has to be 
associated with a different direction. Furthermore, since this new quantum number had not 
manifested itself in experiment, all observable objects would have to have zero charge. This 

14 Alternatively, we could concede that the spin-3/2 baryons are indeed orbitally excited. However, we would 
then be forced to explain why we see no lower-energy SU (3) singlet state which is not orbitally excited. 

15 Even without taking the limit of exact SU(3) symmetry, this problem exists. Consider the Q baryon with 
its spin aligned along the z-axis. Experimentally, the Q is seen to have S = —3; in the quark model, it is 
viewed as a bound state of three s quarks. In the absence of orbital excitations, all of our quarks must have 
their spins aligned along the z-axis to account for the observed spin-3/2 state. Hence we have three identical 
fermions in the same quantum state - an explicit violation of Pauli's exclusion principle without reference to 
SU(3). 
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requires the number of internal directions to be exactly three, since otherwise three quarks 
could not form a singlet. Since the quarks must always be antisymmetrized in this internal 
direction the rest of their wavefunction will be symmetric, exactly as observed. Therefore, 
at the expense of introducing a new quantum number, we have disarmed the most devastat- 
ing argument against the esthetically pleasing quark model. Of course, this postulate seems 
ludicrous at first glance. If the esthetics of the quark model were not so great, this theory 
may never have been advanced. However, the tremendous beauty of the quark model makes 
us want to interpret it physically rather than concede that it is nothing but mathematics. 
However, we cannot simply say that the quantum number is invisible and go home. It must 
have observable consequences. Furthermore, our theory still lacks the mechanism required to 
bind quarks inside hadrons. 

These problems persisted for many years since no believable candidate for a theory of 
dynamical quarks appeared. At the time, people were still trying to make sense of quantum 
field theory itself, so there wasn't even a believable playing field on which to make a model of 
the strong interactions. In his book on current algebra, Gell-Mann suggested that quantum 
field theory be used only as a means to, "...construct a mathematical theory of the strongly 
interacting particles, which may or may not have anything to do with reality, find suitable 
algebraic relations that hold in that model, postulate their validity, and then throw away the 
model." |25| . This was the attitude of the time. Most of the activity in the field in the late 
1960's was in the area of current algebra. Current algebra is a method advanced by Gell-Mann 
which allows one to derive experimentally verifiable relationships between observables from 
the symmetries underlying a theory without using the full formalism of quantum field theory. 
While the agreement between theory and experiment during this time was impressive, it was 
only Gell-Mann's SU (3) that was being tested. 

Real progress in the strong interactions continued in 1969 when E. D. Bloom et al. measured 
high energy electron-proton deep-inelastic scattering at SLAC for the first time |^6| . Their data 
showed definitively that the proton behaved at high energy as though it was made of pointlike 
constituents. This prompted R. P. Feynman to advance his parton picture of deep-inelastic 
scattering (DIS) |27(| , in which he assumes that hadrons are made of pointlike pario?7,s.0 This 

16 He is careful not to use 'quarks' in his model. 
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model has the advantage of describing DIS without assuming much about the nature of partons 
or their interactions. Further experiments at SLAC showed that the partons probed in DIS 
have spin-1/2. It seemed that a revival of the quark model was inevitable. 

Another result from the SLAC data had an even more profound effect on the future of the 
strong interactions. At large angles, certain functions which parameterize the scattering were 
found to scale with energy. Rather than depending separately on the energy and momentum 
transfer to the proton, they depend only on a dimensionless ratio involving these two quantities. 
This behavior is predicted in the current algebra of Gell-Mann, as shown by C. G. Callan, D. 
J. Gross and J. D. Bjorken [E8| , and in the operator product expansion techniques of Wilson 



p9| , which will be discussed in detail in Section 2.5. However, the details of renormalization in 
quantum field theories usually destroy this property. Intending to use the experimental results 
to finally put these nonsensical theories to rest, Gross attempted to prove that there exist no 
interacting quantum field theories which admit scaling. In a paper with Callan, he showed that 
in order for a field theory to scale it must have a property known as 'asymptotic freedom', that 
is it must behave like a free theory in the region of interest (the asymptotically large energy 
region in which scaling is predicted) |30| . Shortly thereafter, Gross and S. Coleman were able 
to show that the only kind of theory known which could be asymptotically free is a non-abelian 
gauge theory J3l[ . In order to complete his proof that quantum field theory cannot describe the 
strong interactions, Gross and his student F. Wilczek tackled the task of calculating directly 
whether or not these theories could admit asymptotic freedom. Much to their surprise, the 
calculation revealed that non-abelian gauge theories can indeed be asymptotically free [[52| .[^] 
This result suggests that the symmetry associated with the quantum number introduced 
by Greenberg, Han and Nambu be used as the local symmetry required for a non-abelian 
quantum field theory of quark interactions. A physical motivation for the introduction of this 
new quantum number, called color because there are three primary colors, had been found in 
the total rate of e + e~ — > hadrons, which was experimentally seen to be enhanced by a factor 
of three over the expectation from only three quarks. The rate of the electromagnetic decay 
7T° — * 27 is similarly enhanced. This marks the first appearance of the theory of the strong 
interactions known today as quantum chromodynamics (QCD), or the quantum dynamics of 

17 H. D. Politzer also published this result simultaneously 
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the color field. 

At this time, it is advantageous for us to leave the historical treatment of the strong 
interactions and focus on QCD. Modern QCD views hadrons as highly relativistic bound 
states of nearly massless quarks and massless spin-1 objects known as gluonsj^] The strong 
interactions between hadrons are viewed as residual interactions, much like the molecular forces 
between neutral atoms. Obviously, this aspect of the modern theory of strong interactions 
complicates matters greatly - especially since we still do not fully understand the way it works 
in QED. Quarks interact by exchanging gluons in exactly the same way that electrons and 
protons interact by exchanging photons. However, in QCD gluons also exchange gluons. This 
new property comes from the fact that QCD is a non-abelian theory as opposed to QED, an 
abelian theory. This will be explained in detail in Chapter 0. For now, we explore some of the 
qualitative aspects of QCD. 

The most important consequence of the non-abelian nature of QCD is the phenomenon of 
asymptotic freedom which led to its discovery. This property causes QCD to become a weakly 
interacting theory at short distances. Looking at this the other way, we see that QCD becomes 
strongly interacting at long distances. Since we don't know how to solve a strongly interacting 
quantum field theory we cannot work out what this implies quantitatively. However, we can 
qualitatively say that if the interaction becomes stronger and stronger as color charges are 
separated, we can never completely liberate a color charge. Since a consistent quantum field 
theory has no choice but to allow the vacuum to create pairs of particles and antiparticlcs 
from ambient energy, the energy associated with separation of color charges will at some point 
become great enough to permit the creation of a pair of opposite color charges from the vacuum. 
At this point, the charges we were separating have happily combined with the new vacuum 
charges to form color singlets, and we find ourselves with two hadrons where we once had only 
one. This property of QCD is called color confinement : the color is physically confined in 
singlet arrangements. Of course, once the coupling gets large enough, the calculations required 
to show asymptotic freedom are no longer valid. It is for this reason that we cannot prove 
the existence of color confinement in QCD and it remains one of the most poorly understood 
aspects of the strong interactions, and, more generally, the entire SM. 

18 These are the analogue of QED's photons. 
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Quark 


Constituent Mass(?n e ) 


Current Mass(m e ) 


u 


550-710 


3-10 


d 


640-710 


6-18 


s 


1000-1050 


120-330 



Table 1: Constituent versus current quark masses for the three lightest quarks. 

One of the biggest mysteries of contemporary QCD is the approximate validity of naive 
quark models. Several phenomenological models of quarks were employed in the 1960's to get 
the observed hadron spectrum. If one assumes that a proton is made of two u quarks and a d 
quark, a A of a u, d, and an s, a K + of a u and an s, et cetera, one can make naive predictions 
of the masses of the hadrons in terms of the quark masses. A spin-spin interaction proportional 
to the inverse masses of the quarks can account for the observed splitting between the spin- 1/2 
and spin-3/2 baryons and give us even more experimental data with which to compare. Once 
we have fit the few parameters of our phenomenological theory to data, we can predict the 
rest of the observed mass spectrum. These predictions are rather accurate, to the credit of 
our naive models. The fits also determine effective masses for the quarks.0 In addition, one 
can obtain the masses of quarks in QCD from the current algebra approach mentioned above. 
These two methods give very different results, as shown in Table [l| 

Another red flag seen in this approach is that it has completely ignored gluons. Gluons 
would certainly be expected to contribute to the energy of these highly relativistic bound 
states. Furthermore, the wavefunctions used to calculate the splitting between hadrons of 
different spin have assumed that the spin of hadrons comes only from quarks. However, since 
gluons are vector particles they are expected to contribute to the spin of the hadrons as well. 
In fact, asymptotically one expects approximately 50% of the spin of the proton to be carried 
by gluons. These models have also ignored the expected contribution from the so-called 'sea' 
quarks. It is well-known that relativistic quantum theories allow the vacuum to require create 

19 The best fits require one to use different quark masses for mesons and baryons. The values obtained from 
meson data are always smaller than those from baryons, and the fits are somewhat better in the baryon sector 
(~ 1%) than in the meson (as bad as ~ 10%). 
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pairs of particles and antiparticles from the vacuum. It would certainly be naive of us to 
imagine that the highly relativistic strongly bound hadronic states consist only of three quarks 
or a quark and an antiquark. The simple fact that these systems are so strongly bound gives 
the system quite enough ambient energy to create many quark-antiquark pairs out of the 
vacuum. Such vacuum fluctuations are called the 'sea of quarks' in the hadron. Any realistic 
hadronic model must take this sea into account as well. 

After all of this analysis, we are faced with the question of why these models work so well. 
It seems that somehow the interactions of QCD arrange the quarks, antiquarks, and gluons in 
such a way that a description of hadrons in terms of naive valence quarks is appropriate for 
certain quantities. These effective degrees of freedom are commonly called constituent quarks. 
The standard belief is that while these constituent quarks have the effective masses listed in 
the first column of Table [l], the true quarks of QCD have the much smaller 'current' masses 
in the second column. Of course, due to the effect of confinement, the masses of quarks and 
gluons are not well-defined within quantum field theory. Strictly speaking, the mass of an 
object is defined only as that object travels asymptotically far away from any other object. 
Since colored objects cannot do this, the 'mass' of a colored object has no meaning in this 
sense ,F] 

One of the major difficulties with QCD is the fact that its degrees of freedom are not the 
same as those we see in the laboratory. This is an immediate consequence of confinement. If we 
would like to understand any strong interaction scattering process, we must first understand the 
complicated bound states involved in the process. This is an extremely difficult path to take; 
we still do not fully understand the scattering of complicated atoms. Nuclear bound states are 
even more difficult. Using QCD to understand even simple nuclei is undescribably complicated. 
For this reason, physicists have worked to find a theory which is expressed in terms of the low 
energy degrees of freedom. Since such a theory has composite objects as its fundamental 
fields, it cannot be expected to make sense for arbitrarily large momenta. However, if the 
momenta involved are small, the effects of the internal structure of the fundamental fields can 
be absorbed into unknown constants and fit to the data. Such theories are required to have the 

20 This is actually a property of any unbroken non-abelian gauge symmetry, whether or not it is asymptotically 
free. 
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same symmetries as the underlying theory, but are otherwise unconstrained. This naturally 
leads to a tremendously complicated theory whose lagrangian has infinitely many terms. The 
hope is that there is some small parameter in the theory which allows one to calculate only to 
finite order. 

There are several theories which have achieved a limited degree of success in the past, 
all of which are derived from chiral symmetry. Chiral symmetry is an isospin-like symmetry 
possessed by QCD in the limit of small quark mass. An effective theory based on this symmetry 
ends up being an expansion in the ratio of external momentum or the pion mass to a scale 
which turns out to be near the proton mass. This means the expansion parameter in chiral 
perturbation theory is about 1/7. One interesting thing about Chiral Perturbation Theory 
(%PT) is that it restores the pions to their original place as carriers of the strong interaction. 
In xPT, one sees neutrons and protons exchanging pions in a way equivalent to that originally 
envisioned by Yukawa. However, this theory is much more complicated. The infinitely many 
terms in its lagrangian make power counting - the systematic truncation of the lagrangian - 
the most important game in town. In all expansions, one hopes that the next (uncalculated) 
coefficient is not unnaturally large. Even with a very small expansion parameter, this can lead 
to trouble. Unfortunately, many quantities which have been calculated in x?T are plagued 
with large coefficients. This fact of life has made low energy strong interaction physics very 
difficult. One wonders whether or not a sensible effective theory exists for this domain of 
nuclear physics. 

Fortunately, the high energy side of QCD does not have these difficulties. Asymptotic 
freedom assures us that at high energy the QCD coupling is small and we may use perturbation 
theory as we do in QED. On the other hand, confinement tells us that we cannot escape 
the low energy physics we do not understand. Any physical process involving the strong 
interaction must necessarily also involve complicated bound states since they are the only 
physically acceptable external states of the theory. For example, DIS can be considered as the 
collision of a virtual photon and a proton. If we consider low energy scattering, the proton 
collectively absorbs the photon and recoils. At very high energy, the photon does not see the 
proton as a whole. Rather, it resolves the bound state into its constituents and is absorbed 
by one of the quarks. To properly calculate the amplitude for this process, we would have to 
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consider high energy scattering on an arbitrary parton in the proton and then average over the 
existing species of parton. High energy scattering on arbitrary partons can be done in QCD 
perturbation theory. However, the averaging process requires knowledge of the elusive proton 
wavefunction. Although we cannot calculate it, this object appears in many experiments. Since 
a proton is a proton regardless of what experiment we decide to subject it to, this information 
could in principle be used to predict the outcome of other experiments. More importantly, 
if we can use the perturbative calculation of the scattering to extract this information from 
experiment, we will have quantities which relate the physical degrees of freedom to those of our 
microscopic theory. This somewhat roundabout way of using perturbation theory to extract 
hadronic structural information from experiment is the topic of this dissertation. 

In Chapter 0, 1 give a somewhat detailed introduction to the basics of perturbative quantum 
field theory as applied to QCD. This chapter is meant to provide a mathematical foundation 
for the rest of the thesis, as well as familiarize the reader with some of my conventions. 

Chapter || is devoted to the aforementioned process of DIS. First considered in the late 
1960's, this process provides the perfect first step for a look at factorization theorems, par- 
ton distribution functions and operator product expansions. It is here that the theoretical 
groundwork for my original research on deeply virtual Compton scattering (DVCS) is laid. 

Chapter [3] encompasses the process of DVCS. Introduced by X. Ji because of the structural 
information it probes j34j], DVCS can be used in conjunction with polarized DIS to extract the 
contribution to proton spin due to quark orbital angular momentum. This quantity cannot be 
isolated by any other experiment proposed in the literature. Using the treatment of Chapter 
|^ as a model, we calculate the one-loop QCD corrections to DVCS at the leading twist level 
and use the results to generalize Wilson's operator product expansion of two electromagnetic 
currents. More importantly, we show that this process is factorizable at this level to all orders 
in QCD perturbation theory. The material presented in this chapter was first published by X. 
Ji and I in H] . 

Chapter^ concerns the one- loop evolution of the twist-3 structure function fr(x) measured 
in DIS with transverse polarization. After a brief review of some of the complications that 
arise in transverse processes, we discuss the problem of its horrendous anomalous dimension 
matrix. The fact that this distribution mixes freely with the more general two-parton cor- 
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relators G\(x,y) and G2(x,y) causes immense complications in the analysis of experimental 
data. These other distributions are known to decouple in the limit of large N c at the one-loop 
level |36|, leaving only the simple evolution experienced by twist-2 distributions. This result 
is studied from a new standpoint in which the reason for the simplification becomes obvious. 
Our technique is extended to predict the outcome of the two-loop large- N c result for twist-3 
distributions and the one-loop twist-4. Unfortunately, this analysis reveals contributions which 
will surely destroy the simplification in these sectors. The material presented in this chapter 
is based on a paper published last year by X. Ji and I in |37j . 
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Chapter 1 

The Basics of QCD 



In the introduction, I have outlined QCD qualitatively as a renormalizable quantum field 
theory describing the interactions of spin- 1/2 quarks and spin-1 gluons. The form of this 
theory is governed by a gauge principle based on a local SU(3) symmetry. This chapter is 
dedicated to explaining quantitatively what that means. 

1.1 The Lagrangian 

To build the lagrangian of QCD here, I will employ certain naturalness arguments which I 
believe to be a nice way to look at the SM. These arguments are certainly not intended as 
proofs that this is the only way the theory of strong interactions can be formulated. The 
validity of the theory is placed solely in the hands of experiment. However, it is interesting to 
note that one can build QCD in this way given only a knowledge of quantum field theory and 
an exact SU(3) symmetry for some Dirac fields. 

We begin with a theory of free spin- 1/2 quarks. Assuming nj different types (flavors) of 
colored quarks, we write our free lagrangian density as 

«/ _ 

C = J2i>f(i@-rn f Wf . (1.1) 
/=i 

ipf denotes the color triplet of Dirac fields corresponding to quarks of flavor /. We note 
that quarks of different flavor are uncoupled. Our implicit assumption that color is entirely 
an internal symmetry is reflected in the fact that the quark mass does not depend on the 
color. The fact that this symmetry is internal means that there is no difference between quark 
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colors and our theory should be invariant under arbitrary reparameterizations of color. Since 
reparametrizations must not change the normalization of our fields, we restrict transformations 
to 3 x 3 unitary rotations, i.e. ipf — > Uipf with U an arbitrary unitary 3x3 matrix. If the SM 
consisted only of this sector, these transformations could be made separately for each flavor. 
However, the weak sector breaks this larger symmetry group by coupling quarks of different 
flavors. In some sense, the weak sector forces us to choose the same color convention for all 
flavors; the remaining symmetry is concerned with which convention was chosen. 

The group of unitary 3x3 matrices is called C/(3). Certain elements of f7 (3) do not 
distinguish between different colors; they merely give an overall phase to the entire triplet. Such 
transformations are not associated with color reparametrization invarianceQ, so we exclude 
them from our group. The remaining objects form the group of special unitary 3x3 matrices, 
SU(3). Many of the mathematical properties of SU(3) are discussed in Appendix ^| At 
present we need only the form of a general^ SU(3) matrix, 

U = e*"*' . (1.2) 

The t a , o=l to 8, are the SU(3) generators introduced in the appendix. For now, their explicit 
form is unimportant. We need only know that they are hermitian and traceless. The 8 a are 
arbitrary real numbers which we will take to be small for most of our discussion. 

When we rotate our quark fields, we do so in the same way at every point in spacctimc. 
From the standpoint of special relativity, this seems like a strange procedure. We don't trans- 
mit any information during this process, so it does not violate causality. However, it still is 
not a natural thing to do. Ideally, we would like to define conventions separately at each point 
in spacetime so that the physics we describe is local. This means we wish our lagrangian to 
be invariant under arbitrary spacetime dependent SU (3) transformations. Such local transfor- 
mations are called gauge transformations. 

Since a derivative compares field values at two infinitcsimally close, but separate, spacetime 

1 These are are precisely the transformations which lead one to electromagnetic interactions. 

2 The colorblind SU(3) transformations e 2l "7 3 and e 4l7r / 3 cannot be written in this form. However, since 
these transformations are indeed colorblind, there are rotations in the other sectors which will compensate for 
them. For this reason, we may completely ignore these troublesome phases with the rest of the electrowcak 
sector of the SM. 
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points without taking into account a difference in convention, our present lagrangian does not 
admit this local symmetry. However, if we decide that we really want this symmetry to be 
local, we can alter our theory slightly to admit it. The reason why the derivative term is not 
invariant under the local symmetry is that dip does not transform in the same way as ip. There 
is an extra additive term here which reflects the spacetime change in the transformation itself. 
If we add an extra term to the derivative to cancel this change in the transformation, our 
theory will be invariant. With this in mind, we introduce a covariant derivative X> = d + igA. 

Hermiticity of the lagrangian^ requires gA hermitian. Since we want Dtp to transform in 
the same way as tjj itself, we require T> — > ZTDW . This implies that 

A -» UAU^ - - UdU^ . (1.3) 
9 

Any matrix A with a nonzero trace can be written as the sum of a traceless matrix and a 
multiple of the identity. As mentioned above, the identity is colorblind and contributes only 
to the electroweak sector of the SM. For this reason, we consider A traceless]^] All traceless 
hermitian 3x3 matrices can be written as a linear combination of the eight SU (3) generators, 
so A(x) contains eight hermitian vector fields - one for each generator of SU (3). Objects which 
are related to the generators of a group in this way are said to be in the adjoint representation 
of the group. 

We have promoted our SU (3) color symmetry to a local symmetry at the cost of introducing 
new vector fields. Since these fields transform under the symmetry, it is natural to consider 
them as dynamical fields in our theory. In order to do this, we must endow them with the ability 
to propagate. In other words, we must give them a kinetic term. It is immediately obvious that 
the standard kinetic term, 1/2 d^A^d^A", is not invariant under the local SU(3) symmetry 
we have worked so hard to install. The nonlinear transformation ([O^) makes it difficult to 
construct an invariant kinetic term for A. However, since T> contains A and transforms linearly 
under the action of the gauge group, we can use it to form a covariant object which involves 
derivatives of A: 

= [W^W] . (1.4) 
9 

3 up to a total derivative 

4 This property is not altered by the gauge transformation (JL.3), as can be seen by inspection. 



22 



The scalar T^ v T^ v contains the appropriate terms for a vector boson kinetic energy. It's trace 
is also invariant under the gauge symmetry. Actually, this term and the quark piece above are 
the only gauge-invariant terms of dimension four or smaller^ involving only these fields. [] In 
particular, there can be no term that corresponds to a mass for the A fields. 

Collecting these pieces and normalizing the kinetic term of our vector bosons, we write the 
full QCD lagrangian density as 

nf 

L = Yj s (iTp- ms )^ i -\^T^T^ . (1.5) 
f=i 

This expression looks nearly identical to that of QED. In fact, the only difference is in the 
gauge group. Let us calculate T : 

T» v = d»A u - d v A» + ig[A», A"} . (1.6) 

The first two terms are familiar from QED, while the last is not. Since we can express the 
gauge field A as a linear combination of generators of the gauge group, the last term expresses 
the commutator of two generators of SU(3). For abelian groups, where all of the generators 
commute, this term is not present. This is trivially the case in QED, where there is only one 
generator. In non-abelian theories, like QCD, the inclusion of this term is unavoidable. We 
will see that it leads to self-interactions among gluons, the quanta of the A fields. It is these 



interactions which cause the theory to be asymptotically free, as shown in Section 1.4 



1.2 The Road to Quantization 

Now that we have a lagrangian, we would like to quantize the theory. In scalar field theory, this 
is implemented merely by imposing commutation relations on the fields and their conjugate 
momenta. One introduces creation and annihilation operators^ and proceeds with a straight- 
forward evaluation of matrix elements. In QED, it is not quite that simple. Gauge symmetry 

5 This is required for renormalizability, as will be mentioned in Chapter 4. 

6 There is one other term which is thought to contribute to physics beyond perturbation theory. Its inclusion 
violates the discrete spacetime symmetries of parity and time-reversal, and its coefficient is experimentally seen 
to be extremely small. 



7 See Appendix D.l 
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requires the introduction of unphysical degrees of freedom. This is obvious because a gauge 
transformation is by definition an unphysical thing. It corresponds merely to a redefinition of 
convention. For each configuration of the photon field, there exist infinitely many physically 
identical configurations which are obtained by transforming the original. Taking all of these 
into account separately is highly redundant and creates divergences which must be removed 
before quantization can make sense.0 In order to make sense of a gauge theory, one must 
first choose a gauge so that every physically distinct field configuration is taken into account 
exactly once. The way this gauge is chosen is arbitrary, but certain choices make the resulting 
formalism more complicated than others. 

To fix a gauge, one chooses a function of the gauge field and sets it equal to a certain 
function of spacetime. Normally, people choose a linear function to simplify the manipulations 
necessary to fix the gauge. A standard example is the axial gauge. This gauge choice is made 
by requiring n ■ A{x) = uj(x) for some function u> and some fixed vector n. If n* satisfies 
n* ■ n = 1, then 



is an abelian gauge transformation which takes an arbitrary field configuration into one that 
satisfies the gauge condition. Hence this condition can be satisfied by all physically distinct 
configurations .[] However, this gauge choice does not completely specify the gauge. If v satisfies 
v • n = 0, the gauge transformation 



does not change n-A. This is called a residual gauge transformation. If one wishes to completely 
specify the gauge, one must enforce a further gauge restriction to specify the residual gauge. 

8 This is purely an artifact of our mathematical incompetence. If we were smarter, we would not have to do 
such violence to gauge theories in order to make sense of them. 

9 In a non-abelian gauge theory, the required transformation is somewhat more involved. An explicit solution 

is 




(1.7) 



A(x) A(x) + df(v ■ x) 



(1.10) 



U = (Pe 




) 



(1.8) 



where (P) P denotes (anti-)path ordering of the exponential. This ordering is constructed so that 



n ■ dU = ig{Un ■ A - uU) . 



(1.9) 
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This is not usually necessary. In many cases, the ambiguities associated with residual gauge 
transformations are not enough to interfere with the quantization process. With the choice 
of axial gauge, the residual degrees of freedom exist in a special corner of momentum space. 
We will see that our gauge-fixed theory regards this corner as a singular region of momentum 
space, and special care must be taken near it. 

In the axial gauge QCD and QED are very similar, but the problems associated with 
its residual gauge transformations can be quite severe. The axial gauge also breaks Lorentz 
invariance explicitly by introducing the special direction n during the quantization process. 
For these reasons, another gauge-fixing function is often employed. The 'covariant gauges' are 
obtained by specifying d ■ A(x) = u>(x). Once again, this gauge can be shown to accommodate 
all physical configurations since the abelian transformation 



takes any configuration into one which satisfies our gauge condition.^] This gauge choice also 
contains an ambiguity since any transformation of the form 



with x( x ) harmonic^j leaves d ■ A invariant. Immediately, one can see that the residual 
transformations associated with this gauge choice occupy a much smaller region of momentum 
space than those of the axial gauge. In the previous case, one has an infinite three-dimensional 
subspace of momentum to contend with. Here, only the light-cone itself is a singular region. P| 
This region is the three-dimensional surface (q ) 2 = \q\ 2 , which is compact for each value of 
\q°\. Furthermore, the divergences arising from this ambiguity in axial gauge require special 
treatment. Those appearing in covariant gauges can be treated along with other divergences 
which appear naturally in quantum field theory. We will see this in detail in the following 
sections. 

Let us now proceed with the quantization process. To illustrate the similarities between 

10 Once again, this transformation is more involved for non-abelian groups. In this case, the explicit trans- 
formation is too complicated to present here. 

n I.e. d^x{x) = 

12 The statement d ■ d\ = reads q 2 = in momentum space. 




(1.11) 



A"{x)^A' 1 (x) + d' 1 x(x) 



(1.12) 
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QED and QCD, we will not specify the gauge group immediately. In my view, the best way 
to see the process of gauge fixing lies in the path integral approach.^] In this approach to 
quantum field theory, vacuum matrix elements of an operator are understood as weighted 
averages of the operator over all possible field configurations. The weight given to a particular 
configuration is the exponential of its action, £>[<&] = J d 4 x : 

where $ stands for all fields in the theory and TO is any n-point correlation function of any of 
the fields <E> appropriately time-ordered|^| The path integral takes all field configurations into 
account, which means that it is redundantly integrating over a continuous infinity of physi- 
cally identical gauge configurations. The idea behind gauge fixing is to make this redundant 
integration explicit so it will cancel in the ratios appearing in physical matrix elements. Let 



us study only the denominator of Eq.(1.13). Our derivation will follow for the numerator as 
long as O is gauge-invariant. Parameterizing a general gauge transformation by real functions 



6 a {x) as in Eq.(1.2), we write the identity 

fsg a [A e ] 



J ve a 8{g a [A e ]-io a 



det 



(1.15) 

A 



This equation says that there exists a unique;] gauge transformation parameterized by 
6 a {x) which will transform a given arbitrary field configuration A into a new configuration 
A e which satisfies the gauge conditions <7 a [v4 e ] = w a .|^] The determinant is the Jacobian of 
the change of variables between Q and 9. Inserting this into the denominator of Eq.( 1.13| ) 

13 For a derivation of the path integral and a discussion of its technical points and physical meaning, see Ref. 
[ [Isf . Essentially, the path integral is an instruction to sum the contributions of every field configuration. This 
is done by integrating over the value of a field <E>(x) at every spacetime point x. 

14 One defines the time ordering operator, T, such that 

T mxi)^(x 2 )} = ^S>(x 1 )'S>(x2)e(x 1 - x%) ± ®(x 2 )<f>(xi)0(x% - x?) . (1.14) 

The + refers to bosons while the — refers to fermions. 

15 1 note here that neither of the gauge conditions specified above satisfy this property, as exemplified by 



Eqs.( 1.1C , 1.12). What we really need is for this expression to be independent of A and u) a . Both the axial 
gauge and the covariant gauge satisfy this requirement. The actual number of such gauge transformations will 
cancel in the calculation of any physical matrix element as long as it is the same for all configurations. 

16 Note that we have one gauge condition for each generator. 
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and performing the inverse gauge transformation parameterized by — 9 a on the integration 
variables]^] we obtain 

fsg a [A e y 



V9 a V[1>] 



det 



S(g a [A] -u a )e 



(1.16) 



It is easily seen by inspection that the dependence of the determinant on 9 is entirely illusory 



so the integration over 9 factorizes completely and cancels in the ratio (1.13). This factor 
directly represents the redundancy that led to the need for gauge fixing in the first place. 

Since the (v a (x) were introduced as arbitrary functions, we may consider a weighted average 
over all possible cu a : 



1 



/ T>u) a f[u) a ] 



det 



SG a [A e 



5{g a [A]-oj a )e 



(1.17) 



The integration over uj cancels independently of the form of the functional f[to a (x)] if we use 



the fact that Eq.(1.16) does not actually depend on uj a . On the other hand, we may consider 
the denominator an unimportant (though infinite) constant and use the 5-function to do the 
u a integral in the numerator : 



det 



SG a [A e 
S9 b 



(1.18) 



A-" 



J Vuj a f[uj a ] 

Note that although the constraint is no longer explicit, it is obvious from our construction that 
in some sense £/ a [.A] does not depend on A^ 

All that is left is the calculation of the determinant. Since this depends nontrivially on 
the form of the gauge condition and the specific theory under consideration, we cannot go any 
further without specification. 

Since QED is less complicated, we will consider it first. In an axial gauge, 



g[A 9 } =n-A+l/gn-dO 



(1.19) 



Hence we are asked to evaluate the determinant of 1/g n ■ d. The meaning of this object is not 
obvious. If we really wanted to evaluate it, we would have to consider the actual definition of 

17 Since this transformation is unitary on the fermionic fields and the gauge fields (plus a trivial constant 
shift), the measure of the path integral is invariant. If it weren't, the path integral itself wouldn't be gauge 
invariant. 

18 This statement is true in the sense that, once all physically indistinguishable field configurations are taken 
into account, the contribution from each physically distinct set of configurations is identical. 
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the path integral. Such things are considered in |p8| . Here, we do not care about this infinite 
constant. Since it is independent of all the relevant fields, it will cancel in the calculation of 
physical quantities (much like every other constant we have ignored in this derivation) . Hence 
in this case the determinant may be completely ignored. 

In a covariant gauge, the Jacobian, det(l/<7<9- d), is also constant. This leaves the function 
f[uj] as the sole remnant of our gauge- fixing in QED. Since we must live with it, we intend to 
use our freedom to choose the form of f[ui] to the best of our advantage. This is accomplished 
by choosing a function which will add a quadratic term to the lagrangian. Any other choice 
would lead to new vertices for us to take into account in perturbation theory.]^ The function 
f[u>] — exp[— i J d^x ui(x) 2 /2S t ] suits our purpose. 

The only effect of all this work is the simple shift in the lagrangian density 

raxial r§ i n ' *^) /i nrA 

■'-QED — '-'QED ^ U- ZU J 

for the axial gauge and 

r cov _ r o (d • -4) 91 x 

•'-QED — •'-QED ~ 2£ U- Z1 J 

for the covariant gauge. There is one degree of freedom left from the choice of weighting 
function - the number £. In principle, this object could be a function of spacetime since 
the weighting was completely arbitrary. However, again the limitations of our mathematical 
abilities force us to be less general than the derivation requires. As it is, we take £ as an 
arbitrary constant. This degree of freedom is useful as a check of lengthy calculations since its 
dependence must cancel in all gauge-invariant quantities. 

We turn now to QCD. The similarities between QCD and QED are more apparent in the 
axial gauge, so we begin with this choice. Our first task is to rigorously define what is meant 
by the Jacobian under consideration. The matrix 5G/50 is unambiguously given by 

g a [A 6 W +se W(x)]-g a [A e W(x)} = J d i y{Sg/69) ab {x,y)69 b (y) . (1.22) 

As mentioned above, our Jacobian does not actually depend on 9. Hence we may take 9 = 

19 Actually, any other choice would lead to disaster in the sense that one still could not define a propagator. 
In this way, the choice of f[ui] is made for us by the way we know how to do quantum field theory. Of course, 
in principle the choice is arbitrary. However, we must know how to live with whatever choice we make. 
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to simplify our task. For infinitesimal S9, Eq.(L3) reduces to 



A^A--[V,89}. (1.23) 
9 

The second term in this expression is the covariant derivative of 56 : 

[D,6] = t a V ab 6 b ; 

V ab _ d5 ab + gf abc A c , (1.24) 
where f abc are the structure constants of SU(3) defined in Appendix |C]. Our matrix is given 

by 

--n-VS b Se\x) = [ d 4 y(Sg/Ser b (x,y)d9 b (y) , (1.25) 
9 J 

so 

This matrix appears to depend on A through the covariant derivative. However, the de- 
pendence of the matrix on A is of the form n • A. Since we have fixed this component of A, it 
no longer depends on the physical configuration under consideration. This allows us to ignore 
the determinant once again, making the quantization process in axial gauge identical for QED 
and QCD. The only difference between the two theories appears in the gluonic self-coupling 
terms in the kinetic energy. 

The situation is not so simple in covariant gauge. The derivation of the matrix goes through 
as before. We find 

(Sg/S6) ab (x, y) = --d- V^S^ix - y) . (1.27) 
9 

This time, the dependence on A is not illusory. Since d is a differential operator that will act 
on the J-function as well as the gauge field, the determinant of this matrix will contribute to 
the dynamics of our gauge-fixed theory. We must figure out some way to take it into account 
in our calculations if we want to use this gauge. 

By direct analogy to iV-dimensional integrals, one can perform Gaussian path integrals 
exactly. For any nonsingular hcrmitian matrix M., we have 

VtfVfa* I dixdiy *U*)Mu(.x,v)*(v)i = idet^p 1 (1.28) 
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if the fields 4> and (ft are bosonic (or commuting) fields and 

TtfThfie'f*****"*'^''^*^ = \detM\ (1.29) 



for fermionic (anticommuting) fields ip, ip. Hence a simple way to take care of our problem is 
to utilize auxiliary anticommuting fields c and c to exponentiate the determinant. These fields 
are not physical. They are only mathematical artifacts we introduce to help us through the 
day. With this little snag out of the way, we write the QCD lagrangian with covariant gauge 
fixing asFI 



£qcd = £°qcd - {9 ^ -cd-Vc. (1 .30) 



The new anticommuting 'ghosts' are very strange objects. Contrary to appearances, c and 
are completely unrelated complex fields^ which behave dynamically like scalars. Their sole 



function is to account for the Jacobian in Eq.(1.15). If it weren't for the gluon field appearing in 



the covariant derivative, they would not interact with any physical field and their fluctuations 
would simply shift the energy of the vacuum by a constant amount. These shifts would then 



cancel with the fluctuations of the denominator of Equation (1.13) for any properly normalized 
physical matrix element. However, their interaction with A allows an operator O to polarize 
these fluctuations so that they do not cancel its matrix elements. We will come back to the 
story of ghosts later when we consider the renormalization of the gluon fields. 

At this point, it is convenient to introduce source terms into our action in order to define 
a generating functional for correlation functions : 



S[$,J] = 5[$] + J d 4 xj(x)<f>(x) 



I have introduced a source for each field in the theory. Since taking functional derivatives with 
respect to the external source J and subsequently taking J = can give us any correlation 
function,^] the only object we need to consider is Z\J\. This is by no means a trivial task. 

20 The factors of i and g go into our overall normalization as before. 

21 The bar over c is merely convention. There is no 7 implied here. 

22 Of course, here we must worry about the interchange of functional integration and functional differentia- 
tion. While this interchange is extremely difficult if not impossible to justify, the theory it leads to is the same 
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The difficulties associated with QCD stem almost exclusively from an inability to calculate this 
object. A whole branch of QCD is dedicated to calculating this quantity numerically. Even in 
this approach there are many problems associated with discretization, finite lattice size, finite 
computing power, etc. Here, we will approach the problem in the same way it is approached 
for QED. Assuming the coupling g to be small, we solve the free theory and take interactions 
into account perturbatively. This method is certainly not without its shortcomings, some of 
which I will try to explain along the way. 

Since we can solve the free theory using flL28| ) and jl~29| ), it is advantageous for us to 



separate this part from the interaction part. We will do this explicitly for the covariant gauges 
since they are the most complicated. The construction is identical in axial gauge (ignoring the 
ghosts). 

£QCD[* S J\ = £free[$] + dntfi] + C source [^, J] J 
f 

1 (d-A a f 



— — (dy.Al - d v Al){d»A au - d v A ai1 ) 



2£ 



■a q2 „a 



-c a c 



f 



+^gf ahc AlAld»A cv - ig 2 f abc f ade A l ,f *A c " 'A* At 



+gf abc rd^Alc c 

Csource{®, J] = V^P + V"7 

+CV + c a C a . (1.32) 

As advertised, the free part of the lagrangian density is is quadratic in the fields and the 
interaction part vanishes with the coupling We have argued above that any correlation 
function can be obtained by functional differentiation with respect to the sources. This means 

as that obtained by other methods in all cases where there are other methods. Such technical issues are taken 
up to some extent in Jji) . 

23 Of course, this is the reason g was introduced in the first place. 
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that we can get any functional of the fields under the path integral by judicious use of the 
source terms. In particular, the functional exp[i<Sj n t[<I>]] :Q : 

This is as far as we can go without getting our hands dirty. At this point, we must do the 
path integral. Fortunately, our manipulations have resulted in a problem we can solve. Our 
path integral now splits into three integrals - one for each species of particle in our theory 
(counting ghosts and antighosts as the same species). Since it is only the source dependence 
that matters, we can complete the square usingp^| 

$ f X$ + J f $ + = ($ f + M~ X )M{§ + M~ l J) - J ] M~ l J , (1.34) 

shift the integration variables, and obtain^] 

D[$]e iSfr "+ 3 ° ur " [ ' p ' J] = e~ i J ' d4xd4 yJ' t ( x ') M ~ 1 ( x >y)3'(y') 

U^Je* f d i x d^y <S>Hx)M(x.v)<S>(y) ^ ^ 



Once again, we ignore the infinite shift to the vacuum energy and obtain 

P iS<nt[iA] p -i / d i xd i y jHx)M- 1 ( x ,y)J(y) 
Z[J] = — —. . (1.36) 

e iS mt [i^ r ] e -tJd^xdiyJf(x)M-Hx,y)J(y)y_ o 

All that is left is a calculation of M. -1 for each species of particle. 

We start with the ghost fields since the inversion is simplest for them. The free ghost 
matrix is written 

M a g Lt(^y) = -d 2 5 ah 5^{x-y) . (1.37) 
By definition, the inverse of this matrix satisfies 

d 4 z(M- 1 ) ac (x,z)M cb (z,y) = 8 ab S^(x- y) . (1.38) 



24 This is why we must fix our gauge before introducing sources. If we did not, we could only couple sources 
to gauge-invariant field combinations. While this is fine for physical quantities, the gauge-dependent separation 
of the action into 'free' and 'interaction' contributions would prevent us from introducing perturbation theory. 

25 This expression is valid for any nonsingular matrix M. This is one reason we must fix our gauge before 
we quantize. 

26 This result is valid only for complex fields. For hermitian fields we use + J§ = 1/2 &'M§' — 

1/2 JM- 1 J. 
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Obviously, the 5/7(3) inverse is trivial : (M~ 1 ) ab ~ 5 ab . It is the spatial inversion that we 
must think about. Since we have not specified a coordinate system, Mr x cannot depend on 
one. Hence our inverse matrix is a function only of the difference x — y. Performing a Fourier 
transform on this variable, substituting the Fourier representation for the 5-function in M, 
and inverting the transform we obtain 



or 



d 4 k 5 ab 



-ik-(x-y) 



(1.39) 



(1.40) 



(2tt) 4 k 2 

In the other two sectors the calculation is similar, but complicated by the presence of spin 
indices The matrices 



{M qU ark)a(it (x,y) = (id^p - m5 af ^j 6 (4) (x - y) 



have inverses p*| 

( M quark) *p(x,y) 



a b 



g^d 2 



i - I ) d u d v 



8 ab 8 {4) {x-y) 



d 4 p (jq£+m^) ip . {x _ y) 



(2tt) 4 p 2 - m 2 



(1.41) 



(1.42) 



d 4 g -guu + (i - 0{q^/q 2 ) 5 ab c ~ iq -(x- V ) 



(27T) 



The physical meaning of these objects becomes clear when we study the two-point cor- 
relation functions. For simplicity, we consider the free theory (g—0). The ghost correlation 
function 



(Tc a (x)c b (y)) i 



1 



1 



i5( a (x) V ij $( b (y) 



C=C=o 



i(M- g l st ){x,y) 



(1.43) 



27 It is here that we run into trouble if we allow £ to vary with spacetime. In that case, the Fourier 
transform does not go through in the same way. Presumably it can be done, but the calculation will be 
horribly complicated. 

28 Here, we have completely suppressed quark flavor and color indices. Barring the difference in quark mass, 
these matrices are proportional to the identity in both spaces. 
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is nothing but i times the inverse ghost matrix£j This tells us that the amplitude for a ghost 
to propagate from y to x is given by i(M.~ host )(x, y). If y° < x° , the ghost is traveling forward 
in time and should have positive energy. To see which energies contribute, we attempt to do 
the k° integral via contour. The singularities at k° = ±| k | occur on the integration path so 
their meaning is ambiguous. In order to have a well-defined propagator, we must specify some 
procedure to handle these singularities. For y° < x°, the contour may be closed at infinity in 
the lower-half plane and we wish for only positive energies to contribute. This is accomplished 
by pushing the poles k° — \k \ into the lower-half plane and the poles k° = —\k \ into the 
upper-half plane. This physical^] reasoning leads to the unambiguous propagators 

*•»(*) = j*fL (1.44) 

s(p) = (i.45) 

D a b{q) = [-.^ + (l-0(W(g 2 + ^))] l5 a b {lM) 

F q z + ie 

for ghosts, quarks, and gluons^J of momentum k, p, and q, respectively. 

In axial gauges, these arguments go through in exactly the same way. The quark propagator 
is unchanged and the gluon propagator becomes 



g z + is \ n ■ q + m ■ qe [n ■ q + in ■ qe) z 1 



where I have taken n° > 0. This prescription is motivated in exactly the same way as above ,p| 
but here we cannot maintain frame independence. This unplcasing turn of events is not 

29 The signs in this calculation are somewhat tricky. We must remember that the ghost and quark fields and 
sources are Grassmann variables, and as such anticommute rather than commute. 

30 Since the ghosts are not physical particles, the motivation for the use of this prescription is not clear in 
this context. However, it is essential for the ghost propagator to have the same form as the gluonic propagator 
if the ghosts are to do their job properly. 

31 Strictly speaking, the +is in the second term of the gluon propagator is not required by physics any more 
than that in the ghost propagator. These prescriptions need only be consistent with each other to ensure a 
well-defined gauge theory. It is only a matter of aesthetic convention that we treat them in the same way as 
the physical propagators. 

32 This prescription is appropriate for ordinary time-ordered perturbation theory, in which one imposes 
quantization conditions on the fields and their conjugate momenta at equal times. In practice, one often uses 
the axial gauge with light-like (n 2 = 0) and quantizes the theory at equal n ■ x. In this case, the proper 
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unexpected since our gauge condition has already broken Lorentz invariance. Of course, the 
terms in which we need this prescription are merely gauge artifacts and are not required to 
satisfy the constraints put on physical propagation. Since there are no ghosts in this gauge, 
there is nothing we must be consistent with. For this reason, the complicated prescription 
suggested by physical arguments is often ignored. This leads to singularities of a very special 
kind which are required by gauge invariance to cancel in physical quantities. If they are 
handled properly, this does indeed happen.^] However, the special care one must take in the 
axial gauge can lead to tremendous difficulties. This is one reason the covariant gauges are 
often employed in spite of the complications due to ghosts. 

Although these is's are quite important in many aspects of quantum field theory, there 
are many other aspects in which they are not essential. For this reason, I will not write them 
explicitly unless they are expected to contribute. However, it is understood that they are 
always there. 



1.3 Feynman Rules and Perturbation Theory 

The free theory is nice, but it cannot be expected to teach us the true nature of the strong 
interactions. At some point, we must allow our fields to interact with one another. Aside from 
the mathematical difficulties associated with defining the path integral, everything we have 
done up to now is exact. In order to go further, we must make some approximations. 

As it stands, the generating functional is expressed as a series of derivatives acting on a 
product of exponentials. Since we cannot calculate the infinite number of derivatives necessary 
to take the full exponential into account, let us assume for the moment that the coupling g is 
sufficiently small to justify an expansion : 



e ««»t[*&] ~ l + iS int 



1 6 
~i~8J 



(1.49) 



prescription is given by 



1 n ■ q 

y Ti u — TP ' (L48) 

n ■ q (n* ■ q)(n ■ q) + te 

where n*^ satisfies n* ■ n = 1. This rule for handling the singularity, due to Mandelstam and Leibbrandt |pS|, 
ensures that only those particles with positive 'light-cone energy', n ■ q, will propagate forward in 'light-cone 
time', n ■ x. We will see more of this prescription when we study the axial gauge in Section 

33 The term 'handled properly' is in some sense defined by this statement. 
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Figure 1.1: The fundamental quark-gluon interaction is represented graphically by a Feynman 
diagram. The factor associated with the emission of a gluon with polarization /i and color 
index a is —igt a ^/ ll 1 along with a 5- function to ensure momentum conservation. 

Consider the correlation functional 



Z[% V ,j}_ _ . (1.50) 



IS 1 6 1 S 



'/='/=,/=0 



•3 

In the free theory, this is zero since the external gluon has nowhere to propagate. The way that 
sources appear in the exponentials implies that there must be an even number of external gluon 
helds and the same number of ^'s as ^'s in order for a matrix element to be nonzero. Taking 
into account one iteration of the interaction, we see that our correlation function becomes 

lo / 1\ S ISIS 



mT* , \ ill N tat A / 1 \ d 1 o 1 d 

' * 'vac \ iJor)f{xx)iSrf(x 2 )i03Z(xz) 



x (— ig) I d w 



3 

1\ 5 1 8 „ , fc> 1 5 



x e 



-i J d i y 1 d i y 2 rj(y 1 )S(yi,y 2 )ri(y 2 ) gj\ 

i/2 f d i z 1 d i z 2 j»(z 1 )D f , l ,(z 1 ,z 2 )r(z 2 ) I 

\ v =Ti=j=0 ■ 



x e 

Since the sources are set to zero at the end of the calculation, all points must be coupled 
through propagators. There are only two ways to do this, one of which is zero since the 
generators of SU (3) are traceless. The other gives 

— 4 \ LO 

TiP a (xi)^(x 2 )Al(x 3 )) 

I vac 

d 4 wS^(x 2 ,w) (- igi » 5 (t b ) kl ) S l s U™,xi)D b u Z(w,xs) . (1.52) 



34 i and j refer to quark color while a and fi refer to spin. Here and in the following, I will ignore the difference 
between the fundamental and anti- fundamental representations of SU(3). There are simply too many indices 
floating around here for us to be particular about placement. One can always trace through the derivation to 
resolve any confusion. 
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(c) (d) 



Figure 1.2: This diagram presents a prime example of the reason symmetry factors are some- 
times necessary to avoid overcounting. The Feynman rule for the triple gluon vertex counts 
each of these contributions as separate. In reality, however, only (a) and (b) represent different 
processes. In order to obtain the correct result using Feynman rules, one must include a factor 
of 1/2. 

This expression consists of three propagators, each from an external point to the common point 
w. We would like to split it up into different pieces - one for each propagation and one for the 
interaction itself. However, all of these pieces are at present entangled by the integration over 
w. In momentum space, the separation is clear : 

J \ ' I vac 

= SftfaX-igWkrfspvySfa + P3 -PiWMD^ips) . (1.53) 

The factor associated with the interaction is just —igt b 'y L ' along with a (5-function enforcing 
energy-momentum conservation. This interaction is represented graphically in Figure 1.1. The 
straight lines represent quark propagation and the squiggly ones represent gluon propagation.^] 
Diagrams such as this one are called Feynman diagrams and are extremely useful for organizing 
perturbation theory. 

The above method can be used to obtain a set of rules, called 'Feynman rules', for all of the 
terms in the interaction lagrangian. One simply considers a correlation with an external field 

35 On these diagrams, ghost propagation is represented by a dashed line. 
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for each particle species in the interaction. The rules for all of the interactions in QCD with 



covariant gauge fixing are given in Appendix A. 4. Feynman rules for axial gauge calculations 



are identical except for the gluon propagator, (1.47), and the fact that there are no ghosts. 
With these rules, one can in principle calculate any process to any order in perturbation theory: 

1) One draws a propagator for each external field in the correlation and connects them in 
every way possible according to the rules. 

2) Disconnected diagrams are discarded, as they will cancel with the normalization. 

3) Each interaction vertex comes with an associated (S-function guaranteeing momentum 
conservation. 

4) Unconstrained momenta are integrated over, like every other unconstrained quantum 
number in the diagram. The number of such integrations is called the number of loops. 

5) The anticommuting nature of Grassmann fields generates an extra minus sign for each 
loop associated with these fields. 

6) Certain diagrams come with extra factors because of their high degree of symmetry. 
These 'symmetry factors' arise because of an inadequacy in our Feynman rules. In the deriva- 
tion of these rules we assume that the external fields are all distinct. While this is certainly 
the case if the fields are truly external, it need not be true for internal fields. Consider, for 
example, the diagrams of Figure 1.2. Our rules count each separate contraction as a distinct 
contribution to the diagram. However, there are really only two distinct contributions; Figs. 
1.2a and d are identical, as are Figs. 1.2b and c. Our rules have overcounted the contribution 
of this diagram by a factor of two. In general, symmetrical diagrams must be divided by the 
number of ways one can interchange propagator lines without changing the diagram. 

All of these rules arise naturally from the path integral. One can always resort to the 
tedious method above to ensure the correct symmetry factor or minus sign. 



1.4 Renormalization in Covariant Gauges 

At this point, it seems like we have all the ingredients necessary for a calculation in QCD. 
Let's test this hypothesis by applying our procedure to the quark propagator, (T ip(y)tp(x)). 
This object receives several kinds of corrections, as shown in Fig. 1.3. The diagram in Fig. 
1.3a has two disconnected pieces. The gluon loop is not associated with the external sources 
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and therefore cancels with the denominator in Eq.( 1.3l| ). Such disconnected pieces are called 
vacuum bubbles and represent shifts in the vacuum energy. For our purposes, they can be 
completely ignored P*| Diagrams like the one in Fig. 1.3b are called onc-particle-reducible 
(1PR) since they can be broken into two disconnected pieces by cutting only one propagator. 
These diagrams are special because the momentum of the 'one-particle' propagator is fixed 
completely by the conservation law. This means that the two parts of this diagram are not 
coupled by a loop integration and they may be evaluated separately. Hence one can consider 
only one-particle-irreducible (1PI) diagrams, such as those in Fig. 1.3c, and string them 
together to get the effect of the 1PR diagrams. 

Calling the 1PI quark self-energy — i£, we see that the full correction isQ 



// in 

(1.55) 
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p — to \ 1 — £(p)(^ — to) -1 
where I have summed the geometric series. In covariant gauge, Lorentz invariance tells us that 
£ can depend only on p M . Since the strong interactions respect parity and there is only one 
vector in the problem, we may express £ in terms of two scalar functions of p 2 

E(p) = A( P 2 ) + B{p 2 ) i> . (1.56) 

The explicit breaking of Lorentz invariance in the axial gauges destroys the validity of this 
expression. The special techniques necessary to handle this and the other problems with 



renormalization in these gauges will be discussed in Section 1.6. 

Substituting this expression for £ into the fully re-summed propagator, we obtain 

i£ -1 (p) = (1 - S(p 2 ))(^ - to) - A{p 2 ) - mB(p 2 ) . (1.57) 

Unless A(p 2 ) and B(p 2 ) are rather trivial functions near p 2 = m 2 , the corrected propagator 
no longer has a pole at this position and the residue at the shifted pole is no longer i. The 

36 Although they will not be useful for us, these diagrams do contribute to the effective action of the field 
theory. Analysis of this object gives insight into the vacuum structure of a theory. 

37 Here, we resort to the standard notation 

-J— — (1-54) 
p — m 

for the bare (uncorrected) quark propagator rather than that found above. This is understood to mean 
— m) _1 . The ie will be ignored for a while, but it is always understood to be present. 
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(a) (b) (c) 

Figure 1.3: Three different kinds of contributions to the quark self-energy. The gluon loop in 
(a) is uncoupled to the sources and so contributes only to the vacuum energy. The reducible 
graph in (b) does not contain any new information; its contributions can be re-summed with 
a knowledge of irreducible diagrams, like that in (c). 

normalization of the field is a convention, but the fact that it has changed is disconcerting. 
Although arbitrary, this normalization should be consistent. The resolution of this apparent 
difficulty rests with the fact that we cannot observe the fields present in our lagrangian. 
Turning off the interaction so that we can measure this quantity m in the lagrangian is not an 
option.^] Hence we are forced to let go of any idea we have that the parameter m present in the 
lagrangian is a physical observable and that the field ip is the quark field we see in experiment. 
These quantities are called 'bare' or 'unrenormalized' objects and require corrections before 
we can relate them to experiment. 

We have the corrected propagator in terms of A and B, so we can calculate the shift in the 
mass and the new normalization of the quark field. The new pole appears at a renormalized 
mass kir such that 

(1 - B(mD) (tor - to) - A(m\) - to£(to r ) = , (1.58) 
and the residue at this pole is given by 



Zf = -Res[S(p), tor] 
i 



1 - B(m\ ) - 2m R d(A{p 2 ) + mB{p 2 )) /dp 2 



(1.59) 



38 Of course, in QCD seeing the objects under consideration at all is not an option. As mentioned in the 
introduction, denning quark 'mass' is itself difficult. However, we will ignore these difficulties for the discussion 
at hand as they are nonperturbative problems which do not affect our calculations in perturbation theory. For 
the rest of this chapter, we can ignore confinement completely. The penalty we pay for such arrogance will be 
discussed in the next chapter. 
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Since we have no information on the values of these bare parameters, it is not useful for us to 
express predictions in terms of them. We would prefer to know the variation of observables 
with physical quantities. Removing unphysical parameters in favor of physical quantities is 
called renormalization. It turns out that every term in our lagrangian requires renormalization. 

The idea of renormalization is not new. Solid state physics has been using this process for 
many years. Electrons in the conduction band of a metal behave as though their masses are 
different since the actual excitations observed involve the background lattice. The idea here 
is the same : we are observing excitations on top of a background vacuum. The vacuum in 
quantum field theory is a complicated sea of quantum fluctuations and cannot be treated as 
trivial. The only difference between the solid state example and renormalization in a quantum 
field theory is that electrons in a metal can be removed from the metal and observed free of any 
background lattice. They can never be removed from their electromagnetic field fluctuations 
and observed in their bare form. This is the underlying reason bare quantities have no physical 
meaning. 

There are two main ways to view renormalization. One way is to simply calculate everything 
in terms of bare parameters and substitute known expressions for the physical quantities at 
the end. This is known as unrenormalized perturbation theory. The unplcasing thing about 
unrenormalized perturbation theory is the fact that once the calculation is done, one still 
must do the substitution - something which is certainly not always trivial. The other main 
way to renormalize involves a systematic replacement of bare parameters with renormalized 
ones at each order in perturbation theory. This renormalized version has the advantage that 
unrenormalized quantities never appear in our expressions. Although it will lead to more 
diagrams, it is generally preferred to the procrastinator's method. 

Before we can begin the task of renormalization, we must carefully define our quantities. 
We saw above that the normalization of the fermion field changes so that the residue of the 
pole becomes iZp. Consistency of convention requires the residue for the physical field to be 
i. This can be obtained by defining 39 tp = Z F l ^ 2 tpb since 

J d^xe^(T^{x)m) vac =Z^j dW^<TV> 6 (a# 6 (0)> rac 

39 Prom this point on, I will use the subscript b for bare parameters and leave renormalized parameters with 
no subscript. 
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= i [ft-m- M(p)(tf-mf] l , (1.60) 

where M(p) is defined to take into account the rest of the p 2 variation in A and B. M(p) will 
have nontrivial Dirac structure, but be regular as p 1 — > m. The normalization of the two-point 
correlator is governed by the normalization of the kinetic term in the lagrangian, so one might 
think of this renormalization constant as coming from the operator ipityip. It is also associated 
with the field itself. If we were dealing with a nonlocal field theory, this constant is the only 
renormalization we would need since nonlocal products of fields are renormalized simply by 
products of Z-p. Local field theories are different because the structure of a local composite 
operator is quite different than that of a nonlocal product. This will become quite explicit in 
our discussion of the coupling. 

The gluon and ghost fields also require a field-strength renormalization. We define A = 
Zp^^ 2 Ab and c = Zp/^Cft-PI There is also a relationship between the renormalized mass 
and the bare mass. Since the mass term is the only piece of our lagrangian which couples 
quarks of different chirality, a theory in which m&=0 is fundamentally different from a theory 
in which ^ 0.^| A theory of massless fermions cannot generate a fermion mass. For this 
reason, the renormalized mass must be proportional to the bare mass. We write m = Z^rrib- 
The renormalization of the coupling must also be multiplicative since a theory with zero bare 
coupling is free and certainly will not generate a coupling. Hence we define g = Z~ 1 gi > . We will 
see below that even the gauge parameter £ acquires multiplicative renormalization, £ = Z^ 1 ^. 

How does all of this renormalization change our Feynman rules? Obviously, it doesn't 
change them at all in the unrenormalized version of perturbation theory since all the action 
happens after the calculation. However, in order to calculate quantities in renormalized per- 
turbation theory, we will have to make some modifications. We begin by making the trivial 

40 Although the ghost and anti-ghost field renormalizations need not be the same since they are unrelated 
fields, they will always appear together in our applications. Hence for our purposes only the product of their 
field strengths has any meaning. 

41 Theories with small fermion masses can be approximated by massless theories. The effects of the small 
breaking of this chiral symmetry induced by nonzero fermion masses can be taken into account in perturbation 
theory. This is the idea behind Chiral Perturbation Theory, the theory of pions and nucleons mentioned in the 
introduction. 
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Figure 1.4: The one-loop correction to the quark sell-energy. 

observation that 

m&, = £[$,ra,g,£] 

f 

i 

-i (z A - i) (fl^ - - a^"") 

-(Za^ 1 -!)^!! (1.61) 
+ l -g(Z g zT - l)r bc A;Ald»A c » 
-\g 2 {Zlz\ - l)f abc f ade A b »A cv A c lAl 
-(Z G -l)c a d 2 c a 

+g(Z g Z G Z 1 ^ 2 ~l)r bc c a d^A b l c c . 

This is nothing more than a simple rearrangement of terms after substitution. The Feynman 
rules are the same for the first line of this expression since the lagrangian is the same. All of 
the other terms are intended to be taken into account in perturbation theory. These terms are 
all proportional to g since the free theory has Z = 1 for all renormalization constants. Hence 
each of these terms will simply give us another interaction vertex to iterate in our Feynman 
diagrams. This is the tragedy of renormalized perturbation theory. The price of systematically 
replacing each bare parameter with the physical couplings is eight new interaction vertices. 
This is twice as many as we started with! The extra terms are called counterterms because 
they are constructed to cancel certain contributions which arise in our calculation. For the 
remainder of this section we will be concerned with calculating their coefficients at leading 
order in perturbation theory. 
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Let us begin by actually calculating A{p 2 ) and B(p 2 ). There is only one diagram which 
contributes to these quantities at next-to-leading order (see Fig. 1.4). Using the Feynman 



rules in Appendix A. 4, we find the amplitude for this diagram to beQj 



(1.63) 



v^-^o P )( n \- in %r ^ ( d k ^~ ft + m b)[-g^ + - 0{k„K/k-)} ^ 

A closer look reveals this integral to be divergent. For large k, it behaves as J dk/k^ This 
tells us that extremely large momenta are making contributions to the process. The fact that 
these asymptotically large momenta contribute to this process irrespective of the size of the 
characteristic scales in the problem, p^ and m&, could imply a deep problem with the theory. It 
is divergences such as this one which turned people away from quantum field theory in the end 
of the 1960's. We need not give up so quickly, though. We have just seen that this quantity 
means nothing by itself. Only properly normalized physical matrix elements expressed in 
terms of physical quantities are observable. Hence in a strict sense it is unimportant that this 
quantity diverges.^ It implies only that the bare parameters must also diverge. 

While the above discussion is fine as a theoretical argument, we still have not obtained 
expressions for the quantities A and B (or Zp and Z m ). In some sense, we can write A = 
1 + g 2 oo? + 0{g i oo^f') with a similar expression for B. However, these expressions are not 
very useful and certainly bring doubts about the convergence of our perturbative expansion. 



42 Remember that the external propagators are not included in the definition of S! 
43 The casimir OpS^ = (t a t a )^ is discussed in Appendix ^| 

44 Since the vector index must be carried by the only vector in the problem, p M , the linear divergence J dk is 
thrown away by symmetry. 

45 In fact, from a theoretical point of view we might expect it to diverge. If it didn't, the bare parameters 
of the theory would be well-defined finite objects. However, we can think of a bare parameters as the way 
the particle looks extremely close-up, i.e. at asymptotically high energy. Since there are always corrections at 
a higher scale than we are observing at, we might not be surprised that there is an infinity behind them all. 
Looking at renormalization in this way, it also seems quite natural that the divergences from asymptotically 
large loop momentum combine with the divergent bare parameters representing the way the particles look at 
asymptotically large momentum transfer to give sensible finite results for physical momentum transfer. 



44 



In order to proceed in a constructive way, we must make sense of these divergences. We have 
already argued that the fact that they diverge is unimportant, but now we need to find a way 
to express how they diverge. The important thing here is that we cannot manipulate infinite 
expressions.^] It makes no sense at all to write a relation like ooi — 002 = 3. However, it is 
perfectly correct and meaningful (mathematically) to say lim x ^ oc (x + 3 — x) = 3. What 
we need to make sense of our theory is a regulator - something to make the divergences finite 
until we take a certain limit, which we will choose to postpone until the end of the calculation, 
so that we can manipulate them. As we will see, this necessarily introduces a scale which 
will act as the separation scale between physics we calculate explicitly and physics contained 
in our renormalized couplings.^] Physical processes cannot depend on this arbitrary scale of 
separation, but intermediate parameters can and will gain dependence on it. 

How can we regularize the theory? One simple answer is to merely introduce a high- 
momentum cutoff in our integrals. Instead of integrating to infinity, we could integrate only 
to A. This procedure has been used before in several theories. However, in gauge theories the 
price we pay for such savage butchery is high. If we use this regulator, our theory is no longer 
renormalizable. In unrenormalizable theories, divergences appear which cannot be re-absorbed 
into the constants we have introduced. For example, if the one-loop quark self-energy in Fig. 
1.4 contained a divergence proportional to p 2 , we would require a counterterm of the form 
if)d 2 tp. In the absence of such a term, the theory is not complete. Unrenormalizable theories 
need an infinite number of terms in order to be complete. Unless there is a good perturbative 
expansion which can be truncated, an infinite number of experiments need to be done to fix the 
renormalized coefficients of each term in the lagrangian. This is far from practical, especially 
since the theory we began with has only nf + 1 parameters - the quark masses and the coupling, 
g. The price of the simple cutoff regulator is just too high for us to pay. 

46 There are several schemes which attempt to avoid regularization entirely, defining counterterms during the 
calculation as sums of divergent integrals in such a way that the renormalized quantities are well-defined (see 
[|io|). It is not clear that the manipulations used to define these schemes are applicable to arbitrary order or 
for an arbitrary process. At any rate, the scheme introduced below is easier to use and avoids these problems 
entirely. 

47 This should be obvious from Footnote We cannot say that large momentum quantities combine to give 
a moderate momentum result without referring to a moderate momentum. 
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The reason this cutoff procedure exacts such compensation is that renormalizability is 
not an easy thing to come by - especiaiiy in a theory with vector bosons like this one. In 
fact, the only renormalizable theories involving spin-1 fields are gauge theories. The gauge 
invariance of these theories plays such a crucial role in the proof Pj that it no longer works if 
the method of regulation breaks the invariance.^] Our simple cutoff idea butchers the theory 
with no respect for either gauge invariance or Lorentz invariance. In fact, there are only 
a few renormalization schemes which preserve the gauge invariance of our theory. The one 
which has been proven to be the most useful in many applications, including this one, is called 
Dimensional Regularization (DR). This scheme was introduced by 't Hooft and Veltman in 
1973 expressly because it does not violate gauge invariance. Many of the technical details of 
DR are spelled out in Appendix || The basic idea is that the integral in Eq.( 1.63 ) is only 
divergent because we live in 4 dimensions. If we lived in 3 dimensions, for example, it would 
be perfectly finite as k — > oo. Furthermore, the theory itself is expected to be well-defined 
in any number of dimensions and its amplitudes are expected to be meromorphic functions 
of the spacetime dimension. These assertions, along with a set of rules that are outlined in 
Appendix ||, allow us to calculate matrix elements in an arbitrary number of dimensions, d. 
Our divergences will show up as poles in d— 4. It is only at the end of the calculation that we 
evaluate these (physical) matrix elements at the spacetime dimension d = 4. 

The scale we must introduce along with the regularization procedure is mysteriously absent. 
This is a good thing since the explicit appearance of scales is notorious for breaking gauge 
invariance. However, as argued above, there must be a scale introduced somewhere. It turns 
out that the scale in DR is hidden in the coupling constant. To see this, let us analyze QCD 
in d dimensions. 

The mass-dimensions of quantum fields are obtained from the requirement that the action 
S = J d d xC is dimensionless, implying that C has mass dimension d. Since derivatives have 

48 This proof is the work that won the Nobel Prize in physics for 't Hooft and Veltman mentioned in the 
introduction. The relevant references are jnj. 

49 It can be shown that certain amounts of violence can be done to the gauge invariance before the theory 
actually becomes unrenormalizable. What is required is that the gauge invariance can be restored by adding 
a finite number of new counterterms to the lagrangian. Even so, a procedure in which one need not add any 
terms to the lagrangian (one which does not break the symmetry) is desirable. 
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mass dimension 1, the kinetic terms in the lagrangian fix the dimension of the fields : (d — 2)/2 
for bosons and (d — l)/2 for fermions. Applying this result to the rest of the lagrangian, we 
see that the coupling constant g has mass dimension (4 — d)/2. We would rather work with a 
dimensionless coupling, so we introduce a scale, fx, to carry the dimension required. The bare 
coupling must also be given dimension, fx$. Going back to the bare lagrangian and introducing 
Z-factors again, we see that fio goes along for the ride; g must be replaced with gfx^ 
everywhere in the renormalized lagrangian. The reason that liq appears rather than fx stems 
from the fact that Z g relates the dimensionless couplings g and gi, rather than the dimensioned 
coefficients gfx^~ d ^ 2 and flfoMo • We will see how /xq is replaced by fx below. 

In principle, we are really only interested in d = 4, where g is dimensionless and the fx (and 
fxa) dependence drops out. However, a consistent theory in d dimensions requires the inclusion 
of fx. Hence all of our final results should be independent of fx, but its introduction is required 
by our regulator. A study of the fx dependence of certain intermediate results leads to a deep 
understanding of renormalization and quantum field theory itself. 

Armed with a regularization procedure, we are finally ready to calculate our Z-factors 
explicitly. Returning to the quark self-energy, this time in d dimensions, we write 50 
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Using the integration formulae in Appendix A. 3 
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50 To obtain this result, we have used the fact that any term in which the quark propagator (p — k) 2 — m 2 
cancels is zero since it does not have any scale dependence. This argument is made in considerably more detail 
in Appendix [e| The rules for d-dimensional 7 matrix algebra can be found in Appendix A.]. 

51 Since g appears almost exclusively in the combination g 2 /4n, I have introduced a s = g 2 /4ir. This is the 
true expansion parameter of the strong interaction. 
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where I have introduced e = 4 — d > for convenience. At this point, it seems that finding 
expressions for Z-p and Z m is merely a matter of substitution. In fact, since we only work to a 
certain order in perturbation theory, this process is easier than it appears at first. Looking at 



Eq.(1.58), we see that m is in fact defined implicitly. This could present problems - especially 
in view of the involved form of A(p 2 ) and B(p ). At this order, however, we are not concerned 
with such complications. Since m — mi, is necessarily of order g 2 , we can completely ignore 



the factor B(m 2 ) multiplying it in Eq.( 1.58| ). Furthermore, the difference between evaluating 
A(p 2 ) and B(p 2 ) at m 2 and m 2 is also higher order These observations lead to the trivial 
relationship 

m - m b = A^ Xoop \m 2 b ) + mB^ loop '^) + 0(a 2 s ) , (1.66) 
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We run into some difficulty when we attempt to calculate Z-p. One of the derivatives we 
wish to take diverges like J Q dx x~ 1 ~ <L l 2 . This is not a problem for e < 0, but in order to 
regulate the ultraviolet divergences we require e > 0. The origin of this problem lies in the 
fact that the quantities A and B are not analytic for e strictly positive. This divergence comes 
from the region of small k in the integral^ so it is of a different kind than those we have 
seen. The fact that it appears at all is in fact quite disheartening since it signals nonanalytic 

52 This is true if and only if A(p 2 ) and B(p 2 ) are analytic functions in a region of momentum space which 
includes both m? and m 2 . We have already assumed that this is the case in defining a 'residue' of the 'pole' 
at p 2 = m 2 since these concepts do not exist unless our denominator is analytic in this region. 

53 Here, I use the fact that the last term in our expression for B(p 2 ) does not contribute at O(t) as p 2 — * m 2 . 
Due to the singularity in the integral, there will be contributions at higher orders in the e expansion. 

54 This can be seen from the fact that the integrand 

1 
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(1.68) 
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behavior of A and B near m\ Q This has the potential to make the theoretical constructs of the 

last few pages collapse. Fortunately, there is hope for a solution. The fundamental principle 
of dimensional rcgularization is the assumption that the amplitudes in quantum field theory 
are meromorphic functions of spacetime dimension. Obviously, X(p) is not a meromorphic 
function of e in its present form since we must put restrictions on the sign of e in order to 
make sense of it. However, this form can be interpreted as a representation of the function in 
which its meromorphic nature is not manifest.^] If we assume that this is the case, we are free 
to calculate our divergent contribution in the region e < and continue the result to e > (or 
any other part of the complex e plane) . Since this divergence comes from the infrared region of 
momentum space, it is sometimes useful to keep it separate from the ultraviolet divergences. 
This is done via subscript. Our result for Zp is 



The scheme we have used to define our renormalized parameters above is called the onshell 
scheme since it defines renormalized quantities at the point where fields are on their mass 
shells, i.e. p 2 — m 2 . This is not required by the idea of renormalization. Only the exter- 
nal fields must be normalized properly, and the mass at the pole will be the physical mass 
regardless of how we choose to define m. This fact is exemplified by the existence of unrenor- 

so if we differentiate with respect to p 2 and evaluate at p 2 = , we get the expression 



This integrand does not cause a problem for large k, but can certainly lead to a divergence at small k. 

55 Strictly speaking, this is not necessarily a problem since we really only need analyticity near the physical 
pole m 2 . However, we are attempting to calculate ra — m^ as a power series in ot s . Functions which are analytic 
at m but not at m/, do not have smooth limits as a s — > 0, hence cannot possess a Taylor expansion about 
a s = 0. Since Taylor expansions are the only things we know how to calculate in quantum field theory, we are 
foiled again. At this point, we must either admit defeat and go home or find some way to understand these 
divergences without compromising the analyticity of A and B near either m 2 or m 2 . 

56 For example, the function ^2^—0 2 " * s no * a meromor phi c function in the complex plane as presented since 
it does not converge for \z\ > f . However, summing the series in the region where it converges, we find that it 
is a perfectly finite function of z for all z ^ f . 
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malized perturbation theory, where we choose not to renormalize at all. At the end of the day, 
we simply eliminate the 'bare' parameters in favor of some arbitrarily defined 'renormalized' 
quantities which will be measured in experiment. In two different renormalization schemes the 
definitions of the renormalized parameters and the expressions for physical quantities in terms 
of these parameters will be different, but the combination is guaranteed to be the same. This 
arbitrariness allows us to make the choice which will make our calculations the simplest. The 
only residue of the onshell requirements which is left is the normalization of the external fields, 
which must be done in the onshell scheme. However, since the fundamental fields of QCD do 
not appear as external fields, even this condition is unimportant for our purposes. P| 

One might say that the easiest thing to do is simply not renormalize. The problem with 
this is that all of our parameters are divergent. One cannot measure divergent quantities, so 
in this case one cannot extract the parameters of the theory from experiment. The next best 
thing is to do the least amount of work so that our renormalized parameters are finite. This 
scheme is called Minimal Subtraction (MS). In MS, we define our Z-factors in such a way that 
the counterterms subtract all of the ultraviolet divergences in our diagrams at each order in 
perturbation theory. It must be emphasized here that infrared divergences have nothing to do 
with renormalization. As mentioned above, the bare parameters of our theory are related to 
the way things look at asymptotically large probing energies. Hence they have only to do with 
ultraviolet divergences. Infrared divergences must be handled in an entirely different way, as 
will be discussed at length in the next chapter. For now, we notice that the infrared divergences 
in this calculation occur only when we put the fermion on its mass shell. Away from the point 
p 2 = m 2 , the amplitudes are infrared finite. This is a general phenomenon - offshell matrix 
elements are always infrared finitef^] since infrared divergences arise only in singular regions 
of momentum space. Since we no longer wish to calculate in the onshell scheme, there is no 
reason to consider this singular portion of phase space. 

The counterterms which appear at this order in the quark correlator shift A(p 2 ) and B{p 2 ) 

57 In QED one must be careful to perform this task before comparing with experiment. Otherwise, inconsis- 
tencies will appear between different experiments. We will also see that this step is necessary for calculating 
'physical' matrix elements involving quarks and gluons in the next chapter. 

58 This statement is not true for certain gauge choices. In these cases, the infrared divergences come from 
singular regions of 'gauge choice space'. An explicit example of this phenomenon is given in Section l.( 
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to 



A(p 2 ) -> A(p 2 ) + m(Z m Z F - 1) , 
B(p 2 ) -> B(p 2 )-(Z F -1) . 

Looking at our expressions for A and £?, we see that in the MS scheme 

af S C F 2 



(1.71) 
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where we have explicitly shown the dependence on scheme. The leftover contributions to the 
self-energy are straightforward to calculate: 
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These expressions are somewhat messy. The Euler-Masceroni constant comes from the 
expansion of the T-function and the log A-k comes from the d-dimensional phase space factor 
(47r) d . A general £-loop graph will contain £ powers of the coupling and the phase space, so 
divergences will in general be expressed asQ 



A 2 



-ie/2 



A 2 e^ E 
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(1.75) 



where I have used the truncated expansion log T(l + e) ~ — 7_e£ and the fact that nT(n) — 
r(n +1). A is some mass scale in the process, i.e. A ~ m. Since these factors will always 
appear with the divergences, it is convenient to include them in the Z-factors. Furthermore, 
we are still plagued with the bare scale /io- A fully renormalized quantity should not depend on 
such an object. Z-factors relate renormalized and unrenormalized quantities, so it is natural 

59 Here, I have written the form of the most UV-divergent part of an £-loop diagram. There will, of course, be 
less divergent pieces. The 7b 's and 4-ir's in these terms will also arrange themselves in a similar way, although 
this is more difficult to see. 
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to absorb the /zo dependence into Z. Doing this requires the introduction of a scale, which we 
choose as /i. Hence our divergence is written 



-He/2' 



A 2 e^ 

47Tii 2 



-ie/2 



(1.76) 



The quantity in brackets is associated with the divergence (the transition from bare to renor- 
malized quantities), and so is seen as a part of Z. The leftover piece depends only on physical 
parameters and the renormalized scale, and is free to enter rcnormalizcd amplitudes. 

The scheme described above is called Modified Minimal Subtraction (MS) and will be used 
throughout the rest of this dissertation. In MS, Z-p and Z m are written 

e/2 



V 47T/1q / 



7 MS 



47T 

JmsV ,.■> 



1 a™tf)C F 2(»* 
Air 



-e/2 



(3) + 0(a 2 
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where we have explicitly shown the dependence of a s on /i 2 . It is obvious that this object will 
depend on /x 2 since Z g depends on /i 2 and gi, does not. All renormalized quantities will depend 
on ji. For example, we see that 
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(1.78) 



This means that QCD runs quark masses down as the probing scale is increased. This object 
is in fact not the physical pole mass of the quark (even in the absence of confinement). The 
physical pole mass, as defined in the onshcll scheme, must be independent of fx since it is an 
observable. We note here that it is natural for these quantities to depend on scale since the 
scale decides which corrections are to be included in renormalized quantities and which are to 
be included explicitly in the amplitudes. 

The renormalized quantities A and B can also be calculated, 
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Figure 1.5: The one-loop correction to the ghost self-energy. 

and are somewhat cleaner in this scheme. Note the branch cut in our amplitude at p 2 — m 2 . 
This cut leads to the infrared divergence in the onshell scheme. It is related to the possibility 
that the intermediate states in our calculation are onshell and free to propagate to infinity (in 
the absence of confinement, of course). The condition for this possibility is (p— k) 2 — m 2 and 
k 2 = 0, while (p — k) and k° are positive. Working in the rest frame of p — k, one can show 
that these conditions imply p 2 = m 2 + 2mk° > m 2 . Having learned our lesson from venturing 
into the onshell scheme, we choose to work far away from this kinematical region and take 
p 2 < 0. 

We still have one point which must be addressed. What should be used for //? If we wanted 
to calculate to all orders, this choice would be arbitrary since no physical quantities actually 
depend on /i. However, we have no intention of calculating any physical quantity to all orders 
and no hope of doing so in a finite amount of time (at least, not at this stage of the game). 
With this in mind, our choice of p, should be dictated by the size of the coefficient of a s . 
For — p 2 <<C m 2 , the leftover terms in A and B are all small except possibly the /i-dependcnt 
logarithm. Choosing fi 2 — m 2 eliminates this problem. However, for — p 2 y> m 2 this choice 
would lead to a term ~ log(— p 2 /m 2 ) in the amplitude, rendering the perturbative expansion 
useless. In this region, it is preferable to choose p 2 = —p 2 - Hence we see that for the one- loop 
quark self-energy the natural choice of /i 2 is the larger of — p 2 and to 2 . Of course, we have no 
guarantee that these choices will not lead to disaster at higher orders in the expansion. The 
best choice of p is obviously the one which makes all corrections as small as possible.^ 

The ghost propagator is renormalized in exactly the same way. The relevant diagram (Fig. 

60 In fact, one way to estimate the contributions of higher-order effects is to study the dependence of physical 
quantities on pi?. In regions where the dependence is slight, our corrections should be small. 
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1.5) is almost identical to the quark self-energy. We obtah*^ 
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where C A is defined in Appendix ^|. 

The three diagrams shown in Fig. 1.6 contribute to the field strength renormalization of 
the gluon. Denoting the sum of the amputated 1PI gluon self-energy insertions as — in/*", the 
quark loop (Figure 1.6a) contributes^ 
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The transverse structure q 2 g^ u — q^q v appears as a consequence of current conservation, 
q ll Qf J,v {q) = 0. This property of QCD is essential to its renormalizability |l(], we will 



see in Appendix D.2 that it persists to all orders. The gluon and ghost loop contributions 
(Figs. 1.6b and c) do not satisfy this condition separately; their contributions have the UV 
divergences 
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for the gluons and ghosts, respectively. Note that although each contribution separately vi- 
olates current conservation, the sum does not. This is the reason the ghosts needed to be 
introduced - they exist only to cancel the unphysical gluonic degrees of freedom which appear 
in covariant gauges. 

IP" receives two counterterm contributions. The field strength contributes —i(Zx — 
\){q 2 g tJ - v — q^q v ), while the gauge parameter gives i{Z A Z~^ x — V)q^q v '/£. Since all of our 

61 From here on, I will drop the superscript MS on Z-factors and the like. All results will be given in this 
scheme unless otherwise specified. 

62 The 5(7(3) generators are normalized by the relation T F S ab = Tr [t a t b ]. 
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Figure 1.6: The three contributions to the gluon field-strength renormalization at one-loop 
order in covariant QCD. 

divergences are proportional to the transverse structure, we can simply take Za = Z^ at this 
order. In fact, the counterterm relevant to the gauge parameter leads to nonconservation of 
our current. This is not surprising since this term came from our gauge fixing, so certainly 
cannot be expected to be gauge invariant. However, it can be shown that even the gauge fixed 
theory will conserve current E|, which implies Z^ = Za to all orders. We can see this in 
another way by returning to unrenormalized perturbation theory for a moment. Writing the 
full 1PI gluon contribution as 



- ill^V) - ^Tl(q 2 )(q»q» - q 2 g^)5 ab 
and re-summing the 1PR contributions, we obtain the full gluon propagator 

n ab M = 1 [-9^ + n(g 2 ))6) (g^/q 2 )} , ab 

M (1-U(q 2 ))q 2 



(1.85) 
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It is now obvious that the gluon fields and the gauge parameter are renormalized in the same 



way to all orders. This is a direct consequence of our assumption (1.85) that the 1PI gluon 
contribution is transverse. 

Defining the other counterterm in such a way that it cancels the divergent contributions 
to II, we find 



Za = 1 — : i - — - 

4-7T e \ 47t/lJq 

where n/ is the number of flavors. 



(1.87) 



The leftover finite contribution to II has a serious problem coming from the quark loop 
(Figure 1.6a). As in the quark self-energy diagram analyzed above, this expression contains 
logarithms involving the quark masses, the external momentum scale, and the renormalization 
scale. Once again, these logarithms become large unless we choose the renormalization scale 
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near the larger of the two physical scales. However, we cannot just arbitrarily choose /j 2 to 
satisfy this one particular diagram; there are two other diagrams which also contribute. These 
other diagrams demand a renormalization scale comparable to the external momentum (q 2 ). 
If any of the quarks in our theory have masses much larger than this scale, Figure 1.6a will 
lead to large finite corrections to the amplitude. 

From a physical point of view, this is nonsense - extremely heavy quarks should decouple 
from low energy phenomena. The problem lies in our renormalization procedure itself. We 
are choosing counterterms to cancel all ultraviolet divergences, regardless of the scale at which 
they become important. This is not exactly what we want. The idea of renormalization 
is to choose a scale and calculate explicitly all physics below that scale, while systematically 
grouping the physics above it into constants which renormalize the fields and parameters of the 
theory. Since the heavy quark loop has support only for loop momenta near or above its mass 
scale, choosing a small renormalization scale should make its contribution to the renormalized 
amplitudes vanish. What we see is quite the contrary : the contribution of the quark loop 
diagram to the renormalized amplitudes is divergent as the quark mass goes to infinity. Since 
this diagram has also contributed to the infinite constant Za, and the strict limit to/ — * oo 
gives no contribution to the amplitude^, we can interpret the divergence in the renormalized 
amplitude as an attempt to cancel the divergence in Za- This procedure leads us to believe that 
we should simply ignore the effects of very heavy quarks; the associated physics will be grouped 
into renormalization constants with the rest of the high-energy behavior of the theory. On 
the other hand, we cannot expect all of the physics associated with a quark to be represented 
by a simple constant. If it could be, QCD would certainly not have such rich structure. The 
complete dynamics associated with a heavy quark must be represented instead by a series of 
local operators which give the heavy quark effects the freedom to depend on external momenta. 
This is the hallmark of an unrenormalizable theory. Expanding the integrand of the quark 
loop contribution in a power series in l/mj, we see that each term is more divergent than 
the last.Q This implies that we will need an infinite number of counterterms to cancel them, 

63 This can be seen explicitly from the expression for the diagram; one takes to oo before doing the 
integral. 

64 Of course, this is because our expansion does not converge for large loop momenta. These new divergences 
did not come out of thin air. 
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another indication that the effective theory is unrenormalizable. 

The explicit form of this theory can be obtained by simply doing the path integral for the 
heavy quark fields. The result, up to a trivial constant factor which cancels with the vacuum 
fluctuations, is 

detfl-^ . (1.88) 

V m fJ 

This determinant can be exponentiated via the identity 

logdetM = TrlogM , (1.89) 

valid for any nonsingular matrix M, and added to the lagrangian of the theory. The expansion 
of the logarithm gives the infinite series of local operators required to encode the effects of the 
heavy quark. 

However simple this expansion looks, it is certainly not clear what the trace means in this 
context. For this reason, one usually considers instead the equivalent]^] determinant 

det(l- (i^-m f )- 1 g4) . (1.90) 

Here, the trace of a particular term in the expansion of the logarithm is represented as a heavy 
quark loop with the appropriate number of external gluon fields]^] The contribution from 
each loop is calculated according to the above rules, with the gluon fields treated as external 
sources. At this point, there is no difference between the present calculation and the one that 
led to (1.82). The difference comes in our interpretation of the ultraviolet divergences.^ These 



divergences have the form 



2 / m^e 
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This expression represents a battle between two different limits. If e log(m//^i) « 1, we have 
an ultraviolet divergence which is handled as before by renormalizing the gluon field strength. 
On the other hand, if e log(m///x) » 1, this term approaches zero. Since we cannot take e 

65 again, up to a trivial constant 

66 One simply identifies the factor (i@ — mj) _1 with the free propagator for the heavy quark. 

67 Loops with five or more gluon fields are ultraviolet finite. These contributions are not as subtle as those 
that presently concern us. 
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strictly to zero in expressions of this form, the limit mj — > oo annihilates them.pj Contributions 
which are ultraviolet finite do not require e to regulate them. Here, we can take e strictly to 
zero. Since mj is not strictly infinity, the factor 
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(1.92) 



Grouping the heavy quark effects into a series of local operators is now straightforward. 
We simply calculate the contribution from each nonlocal term in the expansion of ( fUMj ), thr o\v 
the ultraviolet divergence (if any) away, and expand the remainder in a series in 1/m/. Each 
term in the expansion is then replaced by an effective operator and added to the lagrangian. 
As an example, we calculate explicitly the two-gluon contribution : 
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The final form of the local operator can certainly be expressed in a nicer form. In particular, 
for an abelian theory it is entirely equivalent to the gauge-invariant combination 
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For non-abelian theories, it is certainly not clear whether or not the higher order terms in 



our expansion of (1.9C) will conspire to form a gauge-invariant combination at the end of the 
day. On one hand, we know that the original lagrangian is gauge invariant so we would expect 



68 One needn't worry about this expression for light quarks. As we have seen above, the terms reorganize 

2 

f 



themselves in this case; fi 2 is compared with q 2 rather than m 2 
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(a) (b) 

Figure 1.7: The two vertex correction diagrams at one-loop order in QCD. 



gauge invariance of our effective lagrangian as well. On the other, the determinant ( 1.90 ) is 
not obviously gauge invariant due to our cancelation of the infinite constant 

det(i#-m/) . (1.95) 

In any case, we will consider these correction terms small enough to ignore completely. This 



leads us to the result (1.87) obtained previously for the gluon field strength renormalization, 
but with n/ interpreted as the number of active flavors. 

Now, we need only calculate Z g . This quantity appears in several counterterms. Unless our 
gauge symmetry is broken by quantum effects, the g 's in our lagrangian must remain the same 
after renormalization. This can be checked explicitly by calculating Z g in all four different 
ways. However, a general analysis of the ways a symmetry can be broken by quantum effects 
tells us that vector symmetries (symmetries that treat the left- and right-handed fermions in 
the same way) are preserved.^ Trusting this, we calculate the one-loop corrections to the 
quark-gluon vertex shown in Figure 1.7. Since we wish to work in MS, only the ultraviolet 
divergent parts of the amplitudes are required to deduce Z g . Diagram 1.7a. contains the 
divergence 

£(*-£)!0&r«>. 

while 1.7b. contains 



n, ,., , 2 /'//-(• ' \ 6/2 3 
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Here, we have written only the constant factor multiplying the leading structure, —igt a ^ L . In 
this language, the counterterm contributes 

Z F Z^ 2 Z g -l . (1.98) 



3 An exception to this rule is conformal symmetry, which is broken explicitly by our regularization procedure. 
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Putting all of this together and requiring the sum to be ultraviolet finite gives 

« s ( M 2 )2/^\- e/2 ril 2Tp 1 

z * =1 -^^U4^rJ n/ J • (1 - 99) 

As before, rif is the number of quark flavors whose mass is less than the probing scale. We 
notice at once that the gauge dependence has canceled. This is necessary because the gauge 
coupling is independent of gauge choice. It would certainly be strange if renormalization re- 
quired this quantity to acquire gauge dependence. However, the cancelation is highly nontriv- 
ial. Both of the diagrams, as well as the field strength renormalizations, are gauge-dependent. 
The gauge dependence of these diagrams is not truly an artifact of the coupling itself, but 
rather is associated with the external fields. 



1.5 The Beta Function - Asymptotic Freedom 

As we have seen, the ideas behind renormalization make it necessary for renormalized pa- 
rameters to depend on the scale of separation between moderate and ultraviolet momenta. 
Although true physical observables are in fact independent of this scale, any truncation of the 
coupling expansion will introduce dependence. Furthermore, our effective coupling really does 
depend on scale. This scale dependence governs the region of applicability of perturbation 
theory, as we will see below. 

The variation of the QCD coupling with scale is summarized in the QCD /3-function, defined 

via 

P{a s )^^ 2 . (1.100) 

Since this dependence comes only from Z g , the j3- function will not depend on any scales except 
fj, 2 . Furthermore, the fact that it is dimensionless implies that even this dependence comes 
exclusively from a s (/^ 2 ).|^] Since g = Z~ 1 gb and gb is independent of fj,, we have the relation 

P(a s ) = -2a s Z g ~V 2 || . (1.101) 

The leading order result for Z g obtained in the last section gives 

p(a s ) = -« 2 (M 2 )^ (jC a - \n f T F ^ + 0{o? s ) . (1.102) 

70 Since the /9-function is manifestly finite when expressed in terms of the renormalized coupling, factors of 
Wmo)- e -> 1- 
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A look at the QCD lagrangian, Eq.( |l.5| ), tells us that the gluon fields are properly normalized 
for 7> = 1/2.0 Since C 'a — 3 for SU (3) with this normalization, the leading order /3-function 
is negative in QCD with less than seventeen active flavors. This means that if we start with 
a small enough coupling to apply perturbation theory, the coupling will decrease as its scale 
increases. A theory whose /3-function exhibits this behavior is called asymptotically free. 

The analysis of the last section tells us that the natural choice of /j, 2 is the probing scale 
relevant to our process. A different choice will lead to large coefficients of the expansion due 
to leftover logarithms. In this sense, the variation of a s with scale is a physical phenomenon. 
Asymptotic freedom assures us that QCD behaves as a weakly coupled theory when probed 
at high energy.^] On the other hand, our expression for the /3-function also implies that the 
effective coupling grows we decrease the probing scale. Truncating the /3-function at leading 



order and solving ( 1.100 ) gives 

l + A)a s (/4)log(/i|//4) 



where I have introduced (3q = (11 — 2n//3)/47r. Were this exact, our coupling would diverge 
at the scale 

fij = fj,le- a '^ /0 ° . (1.104) 

The truncation of the perturbative expansion will loose its validity long before the coupling 
diverges, so this is certainly not a physical effect. However, it does imply that there exists a 
scale small enough that QCD is nonperturbative. Hence we see that asymptotic freedom is a 
two-edged sword - it implies both that perturbation theory will be applicable at high energy 
and that it will not be applicable at low energy. Since low energy is related to long distance 
phenomena, this result means that the interaction between two quarks will grow as they are 
separated until perturbation theory breaks down and their behavior becomes incalculable. One 
can use this as a heuristic motivation for confinement, although it certainly does not constitute 
a proof. 



71 Appropriately normalized real vector fields have the kinetic term —1/2 iP' A." d^A-v. 

72 Assuming, of course, that there exists some energy at which the coupling is small enough to apply pertur- 
bation theory. This assumption has been thoroughly justified by experiment, as exemplified by the observation 
of scaling in DIS. 
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Although asymptotic freedom tells us that our coupling will formally vanish as the energy 
is increased, it does not tell us at what energy it will become small. Experimentally, a s is 
measured at the Z-boson mass (91.19 GeV) as 



a s {M%) = 0.119(2) . 


(1.105) 


While this certainly seems small enough to apply perturbation theory, higher order effects 
should be measurable. For this reason, a systematic treatment of the corrections to the leading 
/3-function is desired. 
In general, one writes 

(3(a s ) = -(3 a 2 s -{3xa 3 s ---- . (1.106) 


Of course, our renormalized coupling will depend on the scheme we use to define it. The 
relationship between the renormalized couplings in two different schemes can be calculated in 
perturbation theory : 

a s = a s + ja 2 s + Sa 3 s H . (1.107) 


To see how the /3-function depends on scheme, we writep| 




= P + 2ja s j3 + 36a 2 8 + --- . 


(1.108) 


Expanding in terms of a s and using the relations 
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(1.109) 


we find 




A) = 0o 
Pi = 0i 

02 = 02 - 10i -(7 2 -8)00 ■ 


(1.110) 



73 We assume here that the difference between the two schemes is not scale dependent, i.e. 7 and 6 do not 
depend on scale. 
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Hence the first two coefficients of the /3-function are independent of scheme^, but the rest will 
depend on the conventions used to define a s . 

Since these higher-order terms depend on scheme, they may be chosen in any way we like. 
A convenient choice defines 

A>-(t)V 

allowing us to re-sum the (3- function to all orders in q s :P| 
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(1.112) 



l — PiOta/Po 

This function is not Lipshitz continuous near a s = /3o//?i, so we have no guarantee that a 



solution to (1.10C) exists. In fact, an analysis of this equation reveals two distinct branches of 
solutions. The region a s < Po/Pi is completely disjoint from a s > Po/Pi- I n the latter case, 
the /^-function is not aymptotically free. Since asymptotic freedom is observed in nature, only 
the former region is physically realized.^] However, the solution to Eq. (1.100) in this region, 
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(1.113) 



does not exist for arbitrary It is readily seen that one must take 



A > 
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74 This is not true if 7 and 5 are scale dependent. In that case, only the first coefficient will be scheme 
independent. 

75 It may seem as though we have just obtained something for nothing. However, this manipulation is not a 
true simplification; it only plays one on TV. If we wish to take this scheme seriously, we must use it consistently 
in all of our calculations. One way to do this is to calculate an amplitude in a known scheme, say MS, and 



re-express the amplitude in terms of the new coupling via (1.107). The constants 7 and 8 are obtained from 



the requirement (1.111) in conjunction with (1.110). Even after all of this manipulation, our amplitude will 
still be correct only up to the order at which it was calculated. On the other hand, this scheme is certainly not 
without its merits. The simple evolution of a s is offers may indeed be worth the effort. 

76 Although the latter region is not physical in the sense that it is not asymptotically free, it certainly does 
exhibit the kind of behavior we expect for a theory with well-defined infrared behavior. Unless our coupling 
really does diverge in the infrared, the /3-function must become positive as a s becomes large. This, along with 
the physical requirement of Lipshitz continuity, implies the existence of an infrared fixed point. These topics 
are discussed to some extent in 
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to obtain a sensible value for as(/i2)0 The actual value of this bound on /s 2 , depends on 
the number of active flavors and the value of /z 2 ,[[^| but its existence implies that we must be 
more careful when attempting to work beyond perturbation theory. Despite appearances, our 
expressions are really only valid in the region of small a s . 

One can obtain f3\ from a two-loop calculation of any vertex in the theory. Such a calcu- 
lation is extremely complicated, but certain general aspects of it can give us insight into the 
behavior of higher order effects. To begin with, assume 



E 



2~ 





4tt^ 



-te/2 



47T 



(1.115) 



We emphasize here that it is the renormalized coupling which appears in this expansion. 
Keeping all terms up to order a^, we write 

e/2 



Zie 



M 2 g7B X -e/2 



47T V 47TjU§ 



2zi 



/fi 2 e^\ 
4tt ^ 4tt^ J 



e/2 



-2e/2' 



a 



(1.116) 



+ (2-2 ~ 3z 2 ) £ 



-2e/2' 



Evidently, the coefficient we are after is 



_ 3z 2 - 2z 2 
/jl - (4tt) 2 £ 



(1.117) 



Since zi is divergent like 1/e, it looks as though the second coefficient of the j3- function 
diverges in the same manner. This is unacceptable. The renormalizability of our theory 
guarantees a finite renormalized coupling. If g is finite, it does not have divergent variation 
with /1 2 . The only way out of this situation is to require a complete cancelation of z 2 , i.e. 
if Z2 — a/e 2 + b/e then a = 3(ezi) 2 /2. This is in fact what happens. The fact that g 
is finite implies that the double poles in Z2 are not independent of Z\. This behavior is 
expected since double poles come from regions of momentum space where two loop-momenta 

77 Qualitatively, one can identify the two disjoint regions at the point a s = fio/fii. The phase plane portrait 



of (1.100) then reveals this value as a special kind of infrared fixed point. This analysis does not survive the 
transition to quantitative reasoning, however, since increasing the scale then gives ambiguous results. 

78 A standard treatment from the Z mass, changing rif as quark masses are crossed, gives a lower bound of 
approximately 0.4 GeV 2 . 
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are simultaneously large. Renormalizability assures us that all divergences obtained when one 
loop momentum gets large are taken care of by the one-loop counterterms, so these double 
poles are naturally related to products of the one-loop results. This is a general aspect of 
renormalizable quantum field theories - all higher-order ultraviolet divergences in an €-loop 
diagram are related to the divergences inn < £-loop diagrams. Only the 1/e pole in a diagram 
gives new information. Stated compactly, all true -^-loop effects are given by the residue of 
the £-loop amplitude at e = 0. This is a nice result, but difficult to exploit in practice. One 
usually needs to compute the entire divergence structure of a graph to extract the 1/e pole. 
However, it can be an extremely useful check of a calculation. 
The result for f3\ can be found in the literature ^3|. It is 

ft = J^o\ [l7C 2 A -2n f T F (3C F + 5C A )] 

1 / 38 \ , 

= (i^( 102 -y%) • 

Since f3\ > for rif < 9, this coefficient only serves to hasten the asymptotic vanishing of the 
coupling. 

Looking at the diagrams we used to arrive at this result, we see that the /^-function for 
QED can be obtained by taking Ca — and Tp = Cf = 1- In this theory, the /3-function 
is positive in perturbation theory. We can understand this in terms of screening of electric 
charge. A charge sitting in the vacuum polarizes the virtual fluctuations. Since QED causes 
opposite charges to attract, this polarization results in a screening of the true electric charge. 
As we probe the charge at higher and higher energy, we effectively penetrate some of this 
shielding and get a look at the true charge. Hence, we would expect the QED coupling to 
increase with scale. The analogue of this argument in QCD results in anti-screening since 
the gluons themselves participate in the vacuum polarization. The effect of gluons is opposite 
that of quarks, so acts to increase the charge experienced at great distances. Although this 
argument and its extensions cannot rigorously prove that colored objects are confined, they 
can certainly go a long way towards giving us a qualitative understanding of this phenomenon. 
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1.6 Renormalization in the Axial Gauge - 
The Price of a Ghost-Free Theory 

Now that we understand the need for renormalization in quantum field theories, we would 



like to see how it plays out in another gauge. As introduced in Section 1.2, the axial gauge 
seems rather simple. There are no ghosts to worry about and the freedom to choose n can in 
principle allow us to simplify calculations a great deal. The only apparent complication is the 



rather lengthy gluon propagator, Eq.(1.47) 



This gauge belongs to a set of 'physical gauges' since contraction of the gauge propagator 
with its momentum gives 

q v 



(n 2 + £<7 2 ) 



(1.119) 



n ■ q (n ■ q) 2 

which does not contain a propagator pole at q 2 = 0. Hence the unphysical degree of freedom 
associated with the longitudinal polarization state does not propagate in these gauges. This 
is the reason there are no ghosts in axial gauges - the degree of freedom they live to cancel does 
not propagate here. It seems that axial gauge, rather than covariant, is the natural choice for 
gauge theories from this reasoning. 

Once we begin to look at renormalization, however, this gauge changes its tune. The new 
tune is one of violent attacks on Lorentz invariance and unregulated divergences. To study 
these effects, let us consider a simpler situation than the general case. Apparently, taking 
n 2 = £ = simplifies the gluon propagator immensely. Since we intend to choose n as a 
special vector in our problem, we take it collinear to all external momenta in our problem. 
Note that this implies that our use of axial gauge requires collinear kinematics]^] This space 
of collinear vectors is spanned by the basis {n M = (1, 0, 0, — l)/2A,p M = (1,0,0, 1)A}, where 
A is an arbitrary scale reflecting the residual boost invariance along the 3-direction.^j] Our 

79 This is not the case in covariant gauges. There, contraction gives — i£,q v /q 2 ■ Hence in covariant gauges 
unphysical degrees of freedom propagate unless one takes £ = (Landau's gauge). 

80 One can, of course, use the axial gauge in situations which are not collinear. However, the integrals one is 
faced with in these situations are a great deal more complicated than those we will encounter. The choice of 
axial gauge is usually made expressly for the reason that the kinematics of a process favor one direction over 
all others. Otherwise, there is no advantage to its use. 

81 This invariance is really an illusion since any specific choice for n fixes A. 
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collincar kinematics requirement is that all external momenta can be expressed as a linear 
combination of n and p, i.e. all particles move in the 3-direction. 82 Note that we have chosen 
our basis vectors such that p 2 = n 2 = and n ■ p = 1. 

The goal of this section is to obtain the QCD /3-function in the axial gauge. Along the 
way, we will see some major differences between this gauge choice and the covariant choice. As 
with the covariant gauges, our first task is to calculate the quark self-energy. For simplicity, 
we begin with massless quarks.^ On general grounds, we can decompose the 1PI contribution 



nq)=Af{+B 



2n ■ q 



(1.120) 



The appearance of fi is a direct consequence of the explicit Lorentz symmetry breaking induced 
by our axial gauge choice. This also leads to the fact that A and B are functions of q 2 and 
n ■ q. Our choice n 2 = suppresses their dependence on n 2 . The factor q 2 Jin ■ q multiplying 
B causes A and B to have the same dimensionality (since is like an inverse mass) and will 
prove convenient in the future. 



Iterating the 1PI result to all orders to obtain the full 1PR propagator as in (1.55) gives 

-l 



ly 



(-*£) L , 



\ — A — 



1 



2n ■ q 



(1.121) 



One look at this expression tells us that things are not as simple as they were in the covariant 
gauges. We cannot renormalize the quark fields with a simple constant. The form of this full 
propagator implies that our renormalized fields must be related to the unrenormalized ones 



through a rotation in Dirac space; there is no other way to remove the fi. Rewriting (1.121) 



as 



1-A 



1 



2n ■ q 



v/l - B(l - A)- 1 - 1 



82 This is not as restrictive as it may seem. In two-particle systems, we can always transform into a frame in 
which the momenta are collinear. In this frame, we are free to quantize the theory and introduce n. However, 
once we have introduced n we are no longer free to change frames. This is a consequence of the axial gauges' 
violence to Lorentz symmetry. 

83 Since we are concerned here only with the ultraviolet behavior of the theory, this can be done without loss 
of generality. We will briefly consider massive quarks at the end of this section. 

84 We take the quark momentum as q here to avoid confusion with the basis vector p. 
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(1.122) 



1 



1 



2n ■ q 



^1 - B(l - A)~i - l) fi 



it is obvious that the renormalized fields are defined via 

1 



1 



In ■ q 



(1.123) 



This transformation is certainly not a constant multiplicative factor. Its nontrivial dependence 



on n- q implies that the manipulations used in Section 1.4 to define renormalized perturbation 
theory will lead to an infinite series of local operators, each of which renormalizes in its own 
special way. Hence our theory appears unrenormalizable. 

What happened? In covariant gauges, QCD is a perfectly reasonable renormalizable field 
theory. Since the gauge choice we make is in principle arbitrary, there should be no real 
difference between the two choices; we certainly should not have the situation that QCD is 
renormalizable with one choice but not the other! However, we must be much more careful 
when discussing renormalizability. Taken by itself with no gauge-fixing and no regularization 
procedure, QCD is a nonsensical field theory. It is plagued by infinities of many different 
kinds and presents no way for us to extract sensible predictions from its amplitudes. Only 
after we decide on a regularization and gauge-fixing procedure will QCD mean something. For 
this reason, we certainly cannot expect QCD to be renormalizable for any and every choice of 
ultraviolet cut-off and gauge-fixing function. Renormalizability is as much a property of these 
choices as it is of the lagrangian. As mentioned above, QCD is not renormalizable even in 
covariant gauges if the ultraviolet cut-off breaks Lorentz invariance. In the present case, this 
symmetry is broken explicitly by the gauge-fixing procedure. However, since here it is only the 
gauge choice that breaks the symmetry, we can handle these difficulties and define a sensible 
quantum field theory in the axial gauge .p| 

To see how this is done, recall that we are not forced to use renormalized perturbation 



theory. In Section 1.4 we saw that we could also use unrenormalized perturbation theory. The 
results are the same as long as we express the final result in terms of renormalized quantities. 
In particular, all external fields in the amplitude must be renormalized. Writing the full quark 

85 This is valid as long as we choose an ultraviolet regulator which does not further mutilate either Lorentz 
symmetry or gauge invariance. We will employ dimensional regularization for this purpose. 
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propagator as 

y-i* o( F -i)V ; (1 . 124) 



with 



v = VT^a 



i+ ^-M- B[l - Ar1 - 1 



(1.125) 



we see that a properly normalized amputated Greens function is obtained by simply removing 
V~ 1 i/4 from the full propagator of all outgoing quark lines and i/f£ 7°(F _1 )^7° from the 
full propagator of all incoming quark lines. All remaining divergences^] are attributed to the 
operators themselves. 

Now that we understand the basic idea behind renormalization in the axial gauge, let us 
calculate the all-important quantity V . Since A and B start at 0(a s ), a one-loop calculation 
sees only their leading powers in V : 

V~(l- 1 -A--^-Bii) . (1.126) 



2 An - q 

The calculation of A and B is complicated by the presence of 1/fc • n in the gluon propagator. 
This term renders the integration techniques of Appendix [A.3| impotent since it is not quadratic 
in loop momenta.^ Alternatively, using our special collinear kinematics, we can decompose 
the d-dimensional integration asf^l 

d d k = d(n- k)d(p- k)d d - 2 k ± . (1.128) 

Since the denominator contains two powers of p ■ k, this part of the integration may be done 
by contour^] if no powers of p ■ k appear in the numerator. Powers that do appear may be 



86 after a similar manipulation is performed for all external gluon lines, of course 
87 A variation of the Feynman parameter approach in Appendix A3 

i r dz 



based on the fact that 

(1.127) 



AB J Q [A + zB} 2 

can be employed in this situation. Obviously, generalizations of this relation can be used to combine any 
number of normal propagators and 'light-cone' denominators 1/fc • n. However, the introduction of a new 
Feynman parameter here leads to complications we would rather avoid. The techniques introduced below are 
much better suited for our purpose. 

88 Our choice of p and n gives no change in measure. 



89 The application of contour integration in this context is spelled out more clearly in Sections 2.2 and 3.3 
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removed via the replacement 



2 P -k=^^lq 2 , (1.129) 



(n ■ q) 2 

which is valid in the numerator.^] In this manner, we obtain the result 
UsCf t (£\ { -q 2 \~ e/2 [ x dy [y f _ i/M^/J 



2n \2J \47t^q/ J q x 

where x = n ■ q. The ultraviolet divergence is contained in the factor T(e/2) multiplying our 
expression. 



In analogy with Section 1.4, we would like to include only the ultraviolet divergent part of 
E in the definition of V. This procedure leads to an extra divergence of the formP] 

r (i.i32) 

Jo x x-y 

in the second term. Tracing it back through the calculation, we see that our new divergence 
comes directly from the factor 1/k- n in the gluon propagator. This behavior is reminiscent 



of the divergence we encountered in Section 1.4 when we were working in the onshell scheme. 
There, the divergence appeared because of our choice of external momenta. Once we decided 
that we did not wish to renormalize in the onshell scheme, it no longer posed a problem 
for us. On the other hand, this divergence is present regardless of which external states we 
choose. Deforming n 2 away from zero, we see that instead of a divergence we obtain terms 
like log 2 (n 2 q 2 /(q • n) 2 ) in the finite part of our amplitude. The part of our amplitude that 
generates these light-cone singularities does not even contribute to the ultraviolet divergence 
for n 2 finite. 



Tn deriving this result, I have used 

d(n ■ k)d(p ■ k)f(n -q,n-k) ^ = / d{n ■ k)d(p ■ fc)/(n ■ q,n ■ k) (fc g)2 = (1-130) 

by absence of scale. This is true regardless of the form of / since both n ■ q and n ■ k are dimensionlcss. The 
validity of this replacement also rests on the fact that q has no transverse component. This is one place where 
complications due to general kinematics rear their ugly heads. 

91 For finite e, this integral is quite finite. Looking at Eq.(f.l31), we see that the divergence appears as 
r(— e/2). However, consistency requires that only ultraviolet divergences be included in V. A well-defined 
separation between infrared and ultraviolet physics can be obtained only if we use different techniques to 
regularize the two types of divergences. 
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To gain insight into the meaning of these divergences, it is necessary to take a closer look 
at the calculation. A characteristic integral that leads to divergences of this sort is 

r d d k 1 

J (2^p (n ■ k — ui) k 2 (q — k) 2 ' [1.133) 

A simple power-counting argument implies immediately that this integral is not ultraviolet 
divergent. However, performing the fc~ integral by contour, we immediately arrive at the 
opposite conclusion. This fact is extremely disconcerting. Renormalization proofs rely heavily 
on power-counting, so the ultraviolet divergence contained in this one integral has the power 
to destroy the renormalizability of our theory. Actually performing this integral away from 
the singular points in the complex plane of uj reveals that it not only diverges, but produces 
a divergence which is dependent on logarithms of the external momenta. Since log(x) is not 
analytic at x = 0, these divergences cannot even be absorbed by local operators. If we 
intend to take this theory seriously, our lagrangian requires counterterm operators of the form 
ip log(zn • d) ip. Obviously, this is quite unacceptable. 



In order to understand why the integral (1.133) does not follow the rules of power-counting, 
we must remember how those rules are derived. Since the square of a Euclidean vector is 
positive-definite, power-counting arguments are always valid in Euclidean space. In order to 
use Euclidean arguments on a Minkowskii integral, our integrand must allow us to perform 
a rotation through 7r/2 radians in the complex energy plane. The causal prescription for the 
physical propagator poles introduced in Section [T^ satisfies this requirement.^ We have not 
explicitly chosen a prescription for the pole in the gluon propagator, but our manipulations 
imply that our prescription does not depend on any of the other coordinates. Since n • k is 
linear in k°, our poles occur either always above or always below the axis. No prescription of 
this form admits Wick rotations, so we cannot apply power-counting techniques. 



As mentioned in Section L2 , there is a causal prescription for our new poles. Since we intend 
to perform our integrals in light-cone coordinates, we prefer the Mandelstam-Liebbrandt (ML) 
prescription |3j| 

1 v ■ k 2v ■ k , 

(1.134) 



n ■ k (p ■ k)(n ■ k) + is k 2 + k\ + is 



92 This prescription forces negative-energy poles to be above the real axis and positive-energy poles to be 
below. Hence the axis can be rotated freely in the ultraviolet region without encountering any poles. 
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over the one involving k ■ n. The form of this prescription makes our earlier mathematical 
indiscretion apparant. We were considering contour integration in the k~ plane, while the 
poles in the k + plane are the ones that truly dictate the analytic behavior of our integral. The 
fact that we ignored the analytic structure of this pole before induced the non-causal principle 
value (PV) prescription 93 

1 ,.1/1 1 A n-k 



lim — 



= lim 



(1.135) 



n ■ k <5^6 2 \n ■ k + iS n ■ k — iS J (5->o (n ■ k) 2 + S 2 

While this prescription certainly regularizes the light-cone divergence, it is not at all obvious 
that it will lead to a renormalizable quantum field theory. 

To illustrate the difference between these two prescriptions, it is instructive to see how our 
calculation works in each. Since we already have an expression for the quark self-energy with 
the principle value prescription, we begin with this choice. Defining 



Jo = lim 



5^0 



cdx 



6 2 



1 dx 
o x 



(1.136) 



we can extract the ultraviolet divergent parts of Eq. ( 1.131 ) 

e/2 



[M*)]w 



a s C F 2 (£_ 



4tt e \ 47r/i§ 



-e/2 



(4 J +41ogx-4) 



(1.137) 



As promised, a dependence on the log of n ■ q has been generated. 

In the gluon sector, the situation is analogous. The fact that the axial gauge is 'physical' 
implies that the gluon self-energy must satisfy current conservation without any help from 
ghosts. The most general form for the gluon 1PI self-energy with our choice of kinematics is 



-n 2 q 



q ■ n 



q ■ n 



(1.138) 



The full gluon propagator, 

gab 



™ - i-TfT 



n 2 



l - Hi - n 2 



- i 



n ■ q 



-n»q Q 



93 1 should mention here that we intend to use this prescription in the most naieve way possible. The subtlctie 
of defining products and powers of this distribution are considered in [ jijj . 
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id al3 (q) 
x t-— x 



(1.139) 



n 2 



1 - iii - n 2 



- 1 



n ■ q 



-qpn 



where d^ v (q) 



-9" 



(q fJ, n l/ + q v n^)/q ■ n, is obtained by summing the 1PR contributions. 



Inverting the matrix on either side, we see that the renormalized gluon field is defined via 

1 



1 - 



n 2 



- i 



n ■ q 



-n^q a 



AS 



(1.140) 



1-Hi 

At the one-loop level, only two diagrams contribute to II. The quark loop is identical to 
its covariant gauge counterpart and does not depend on our prescription for the light-cone 
singularity. The other diagram, the gluon loop, can be evaluated using the technique outlined 
above. The full result contains the UV divergence 



[n 2 (x)] uv 



a s CA 2 / fj, e 



,2„7E \ ~ e / 2 



4jt e \ 47t^q 
a s C A 2 / y?e< E 



11 irifTp 
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- 4Jo - 41ogx 
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(-4 + 4/ + 4 log x) 



(1.141) 



47r e \ 47t^q 

The three-point function that defines the renormalized coupling receives one-loop correc- 
tions from the same diagrams we considered in covariant gauge. The integrals are a little more 
complicated than in the self-energies, but can be taken readily if we are interested only in 
the ultraviolet divergent part. Let us consider a longitudinally polarized gluon. These ampli- 
tudes are obtained by contracting the gluon polarization index with n-P^| The quark correction 
(Figure 1.7a) has the divergence 



47T 



{-ig b t a jl)-±[C F --£)- 



C A \ 2 f^e^y e/2 
\ 4^ ) 



(4/ + 21ogXi + 21ogx / - 3) 



(1.142) 



while the gluon correction (Figure 1.7b) contains 

Xi +x f 



(-ig b t a ft) 



a s C A 2 fn 2 e^\ e/2 



8tt e \ 47r/iQ 



1 



x-i Xi ~r x f x f 
log 1 log — 



(1.143) 



Here, ar<, Xf, and x g are the light-cone momenta n ■ q for the initial quark, final quark, and 
gluon, respectively. 

94 For example, the leading three-point function for polarization a is —igt ar y a . For longitudinal polarization, 
we write —igt a ]i. 
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The self-energy corrections are performed by tacking the full propagators ( 1.12 2| ) and 



(1.139) on to the leading order diagram, and removing the middle propagator and the ex- 
ternal Z^-matrix to renormalize the external states. The quark, 



(-ig b t a fi) 
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Anfi 2 



4tt 



e/2 
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and gluon, 



(-ig b t a fi) 
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(1.144) 



(1.145) 



combine with the vertex corrections to give the full one-loop vertex function, 

2 / M V^~ e/2 
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(1.146) 



]{H 2 ) = g b 



Xf Xi X^Xj 

If we simply ignore the dependence on Xi,Xf, and x g for the moment, this expression implies 
that the effective coupling at scale fj, is given byf^| 

47r e \ An fil J 

which gives the /3-function found in the last section. 

The fact that the logarithms do not cancel implies that this gauge choice with the non- 
causal prescription for the light-cone singularity is truly not renormalizable, as predicted. p| 



11 2n f T F 



(1.147) 



However, the specific form (1.135) of the prescription is not actually used until the integrals are 

95 Note the difference between renormalized and unrenormalized perturbation theory. In renormalized per- 
turbation theory, we would have determined Z g from a subtraction of ultraviolet divergences. Here, we evaluate 
the full contribution without a thought given to renormalization (except for the external states). The effective 
coupling is determined by the requirement that our matrix element be finite when expressed in terms of it. 

96 In the literature, one can find a calculation of the /3-function in this gauge which does not contain these 
difficulties Expressions for the momentum-dependent 'Z-factors' from various diagrams are given, with 
the result that the unsightly terms cancel in the renormalized coupling. Their calculation makes no mention 
of the gluon polarization, and the wavefunction renormalization factors they employ are appropriate for the 
product ip /iAj_4>- Since this combination does not appear in the lagrangian, it is unclear how to interpret 
their result. In addition, several papers present calculations of the QCD splitting functions using PV Jm. This 



prescription closely resembles that used for eikonal propagation introduced in Section 2.3, so it may indeed 
be appropriate for this purpose. However, if a renormalization procedure that produces sensible results in all 
cases exists for this prescription, it is certainly not straightforward. 
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performed. If we choose instead to allow the light-cone denominators to act only algebraically, 
and simply ignore any terms of the form c/k ■ n, with c independent of k, the embarrassing 
terms never appear. Unfortunately, since the integrals in each diagram contain different limits, 
this 'prescription' is hard to quantify mathematically. In addition, it is entirely unclear how 
this procedure would be implemented at higher orders. 

The situation for transversely polarized gluons is analogous. Here, even the Dirac structure 
is nontrivial. The quark 
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(1.149) 

vertex diagrams combine with the wavefunction renormalization factors to generate the full 
vertex 
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-C F ( (J - — logx/) ^ pi + (Jo + ^ logxO i 

Xg Xi 



Once again, we see that the correct one-loop /^-function is obtained if the offensive contributions 
are simply ignored. 

Before moving on to the Mandelstam-Liebbrandt prescription, it is instructive for us to 
study the way in which the offensive terms pollute our final results. In the longitudinal 



direction, (1.14q) shows that the unsightly logarithms confine themselves to the C^-part of 



the coupling. Tacking external statesp| onto ( 1.150 ) causes all but the Cp-part of these terms 
to vanish. This fact is intimately related to the gauge choice we have made and its implications 
on the physical states of our theory. 

97 This procedure is somewhat delicate since truly onshell external states will be annihilated by fi ji whether 
or not it is accompanied by Xi. The ambiguity comes directly from our choice of collinear kinematics. To 
resolve it, we could choose to add a small transverse momentum component to each of our external legs; since 
this pole prescription is flawed in any case, we will not bother. 



75 



Quantizing our theory at equal values of n ■ x = rather than at equal times, we see 

that the QCD equations of motion^ in the gauge n • A = reveal a severe pathology with our 
formalism : they do not dictate the x + ('time') evolution of either A~ or ip_ = 1/2 fijiip. This 
means that these field operators do not propagate freely in this gauge. Only the transverse 
gluon field components and the physical 'good' quark field ip+ = 1/2 fj flip can be directly 
interpreted in the perturbative theory we have derived. The other components must be re- 
expressed in terms of the freely-propagating ones through their equations of motion before they 
can be considered in our formalism. In light of this observation, it is not surprising that the 
longitudinal contribution, which is associated with A~ , is polluted by CU-type gluon effects and 
the transverse contribution, which contains if)-, is polluted by C^-type quark effects. In some 
sense, the fact that these field components do not propagate freely makes our renormalization 
of the external states insufficient to remove all associated divergences. Notice that the residual 
divergence structures depend on all three momenta, so the renormalization of these 'bad' field 
components actually depends on the diagram they are embedded in, further evidence of their 
composite nature. 

Now that we have seen the consequences of a noncausal prescription, let us see what happens 
when we use one that is physically motivated. The most important practical difference between 



the Mandelstam-Liebbrandt prescription (1.134) and the PV prescription is that ML introduces 
another vector to the problem. This effectively adds a new scale and invalidates arguments 
such as that in Footnote |9^. However, this added complication is more than compensated by 
the fact that ML preserves power counting. | luu | Integrals of the form 

d d k 1 



(2n) d n-k(k- q) 2 



(1.152) 



contribute to ultraviolet divergences, but those of the form (1.133) do not. This leads to a 

98 See Appendix D.l. 
"See Appendix D.2 

100 1 should mention here that the power-counting techniques of covariant calculations must be slightly modified 
to obtain the correct results. Each of our d-dimensions can conceivably generate an ultraviolet divergence, so 
we must power count each dimension separately. The integral 

d d k 11 

(27r) d (q - k) 2 (n • k)(n ■ (k - l))(n ■ (q - k)) (1-151) 
is divergent even in spite of the fact that its overall degree of divergence is negative. 
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tremendous simplification. In particular, the integrals which generate the offensive contribu- 
tions in the PV prescription are automatically thrown away by ML. 

The calculation is straightforward. One simplifies the expression of each diagram, taking 
care to retain only those terms which contribute to the UV divergence, and performs the k + 
integration by contour. The transverse integrations are now trivial, and one is left with a 
simple one-dimensional integral for k~. With the same conventions as above, the self-energy 
diagrams contribute 
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which is certainly quite a bit cleaner than the PV results! It is interesting to note that the 
two prescriptions agree if we ignore the logarithm-type terms. This behavior is also seen in 
the three-point diagrams, Figs. 1.7(a) and (b) in the longitudinal, 
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(1.156) 

(1.157) 
(1.158) 



sectors, respectively. From these results, we easily arrive at the correct (3- function, ( |1.102[) , 
pollutant-free. 

In order to understand why ML was able to circumvent the problems associated with renor- 
malizing the external composite 'bad' fields and generate the correct /3-function, we appeal to 
some of the arguments in the beginning of this section. Since the violence to Lorentz symme- 
try (which leads directly to the fact that different components of the fields behave in different 
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ways) occurs only in the choice of gauge, it is unphysical. Its effects will cancel in the calcu- 
lation of gauge-invariant quantities, giving the same result for all well-defined gauge choices. 
As we have seen, the contribution to the three-point functionfrom each diagram depends on 
the gluon polarization. The PV prescription does not exhibit a cancelation of this depen- 
dence, leading to a directionally-dependent 'renormalized coupling' in direct violation of the 
gauge symmetry. In ML, the observed cancelation allows us to define a sensible, directionally- 
independent renormalized coupling. This is direct evidence that the causal prescription does 
not do further damage to our theory and invalidate our gauge principle. Here, the explicit 
breaking of Lorentz symmetry is indeed artificial and constrained to cancel in a sum over all 
contributions. 

The ML prescription contains one loose end that should be taken care of before we conclude 
this section. When calculating the self-energy diagrams above, we implicitly assumed that they 
have the same form in both prescriptions. In light of ( 1.123| ) and ( 1.140 ), this assumption still 



leads to renormalization 'constants' which depend on the field momenta. Since we intend to 
take ML seriously, we cannot be satisfied with this behavior. Looking at these expressions 
again, we see that for the collinear kinematics we used the replacement q^/n ■ q — > p p is 
valid. As ML introduces p^ independently of the external momenta, dependence on p^ is 
perfectly acceptable]^] To test the validity of this replacement unambiguously, it is necessary 
to consider transverse momenta. We do this explicitly in the quark sector. 

Consider the complete 1PI quark self-energy in ML. Its most general fornf^ is 

E(q) = A 4+ Bp- q i + Cn -q p 1 + Dm b . (1.159) 

For completeness, we work with massive quarks. Iterating this result as above, one obtains 
the full propagator 

. (1.160) 

4 - m b - L 



We would like to decompose this into a form reminiscent of (1.124), but with the added 



requirement that the matrix V be independent of q. Working through this, we arrive at the 

101 Note that this is not the case for PV. There, dependence on the external momenta is unavoidable - even 
in the absence of the logarithms. 

102 Since QCD conserves parity and the self-energy is hermitian in the sense that £T = 7°S7°, terms involving 
products of two or more 7-matrices either vanish or reduce to these structures. 
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full propagator 

l-A-B 



{l-Af 



1 - 



B 



2(1 -A) 



where the renormalized mass is given by 



1 + D 



D 



2(1 -A) 



m = l _ A mi 



along with the consistency relation 



C{l-A-B) + B(l- A) = 



This implies that our renormalized fields are given by 



1> = 



I- A 



VI- A- B 
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D 



2(1 -A) 



(1.161) 



(1.162) 



(1.163) 



(1.164) 



If A, B, C, and D are all independent of q and satisfy ( 1.163 ), we are free to define proper 
renormalized fields in the spirit of Section 1.4 and arrive at a sensible, consistent theory. At 
one-loop order, we can calculate these quantities explicity; with the common overall factor 

a s C F 2 //i 2 e^\" e/2 



47r e \ 47r^o 



we have 



A = 

B = 

C = 

D = 



(1.165) 

(1.166) 
(1.167) 
(1.168) 
(1.169) 



Note that ( 1.163| ) is satisfied to the order at which we work and the correct gauge-invariant 
dependence of m(/i) on /i, 

(1.170) 



djji 



o> s Cf , 

-3— ; ITlllI) , 



2tt 



is reproduced. 

In the collinear limit, n M , fc M , and are no longer linearly independent. Here, we can 
calculate only the combined quantities Aq = A + C and Bq = B — C . Fortunately, our 
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consistency relation ( 1.163 ) allows us to extract A, B, and C from these two: 



A = (2A + B ) + A Q (A + B ) , (1.171) 

B = \- A r B ° B Q (1.172) 

C = - 2-A~-B B ° ' (1 ' 173) 



Using these expressions, we see that (1.123) is just a re-expression of (1.164) in terms of Aq 
and So- In this way, we can calculate the correct renormalization constants using collinear 
kinematics. 

In the gluon sector of the theory, things are similar. Here, since there is no mass, we have 
no need to calculate explicitly the analogues of A, B, and C. The collinear expressions and 
II2 presented above are quite sufficient for all of our calculations. 

It is interesting to note that the entire /3-function is contained in the factor IIi of the 
gluon self-energy. In our calculation of the three-point function, all of the other contributions 
conspire to cancel each other. This is a general property of the axial gauges at one-loop order, 
and provides an explicit example of how changes in gauge can shift contributions from graph 
to graph without changing the final result. 

In this section, we explored a non-covariant gauge choice. We have seen how subtle differ- 
ences in prescription can make or break renormalizability in a gauge which explicitly violates 
Lorentz symmetry. If we had broken the spacetime symmetry more explicitly, summing all 
contributions would not have been enough to restore its former health. This is what happens 
when we regularize divergences by a momentum cut-off, for example. There, special coun- 
terterms must be added to the lagrangian to forcibly restore the symmetry order by order in 
perturbation theory. 

The axial gauge's problems are not insurmountable. As we have shown, the theory possesses 
well-defined matrix elements which produce the correct behavior if implemented properly. 
Though some of its aspects are more involved than those of its covariant brother, this gauge is 
not without its charms. Among other things, it restores a certain degree of physical intuition to 
QCD. This gauge also represents tremendous simplifications in some applications, most notably 
those involving extremely high energy processes. For these reasons, it is often employed despite 
its difficulties. 
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Chapter 2 



Deep-Inelastic Scattering 



In the last chapter, we saw that asymptotic freedom assures us QCD perturbation theory 
is applicable to processes whose natural scales are large compared to the typical energy at 
which the coupling gets large. Unfortunately, confinement implies that all physical processes 
are inextricably linked to the low energy phenomena that lead to bound states. In order to 
calculate even the simplest of observables, we must first understand the complicated QCD 
bound states that appear in our experiments. Since perturbation theory cannot be applied 
to these states, the technology developed in the last chapter is useless for this purpose. In 
the absence of nonperturbative techniques for solving interacting quantum field theories, we 
must throw our hands in the air and give up the task of calculating any strong interaction 
cross-section from scratch. 

Although we cannot calculate truly physical quantities completely in QCD, there is hope 
that we can use experimental results to learn about these nonperturbative quantities. First 
advanced by Feynman in 1969 p7| , this idea is based on a separation of scales in certain pro- 
cesses. If the high- and low-energy contributions to a process can systematically be separated, 
one can calculate the high-energy physics in perturbation theory and use experimental data 
to extract the nonperturbative physics. These nonperturbative quantities can then be used 
not only to predict the outcome of other similar experiments, but also to teach us about the 
structure of QCD's bound states. 

The purpose of this chapter is to describe the scale-separation process in the context of 
inclusive deep-inelastic scattering (DIS). The formalism developed here can be applied to a 
number of physical processes, and represents the main way QCD is used to predict the outcome 
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of experiments and give us insight into hadronic structure. 

The process of DIS, along with the relevant structure functions and kinematics, is described 



in Section 2.1. It is here that we will discuss what is measured in the experiment and how the 
measurements are related to hadronic physics. 



In Section 2.2, a qualitative argument is used to motivate the separation of scales. We use 
this argument to perform perturbative calculations of the hard scattering amplitude and find 
that it contains infrared divergences. These divergences signal soft nonperturbative physics 
that cannot be calculated reliably within perturbation theory. 



Section 2.3 explains that these divergences represent contributions from nonperturbative 
distribution functions. We carefully define these distributions and show that our amplitudes 
are perfectly finite when expressed in terms of them. A measurement of the cross-section, 
coupled with knowledge of the perturbative coefficients, will now allow us to extract these 
nonperturbative quantities. Some of the structural information contained in these distributions 
is discussed, as well as certain consistency relations they are required to satisfy. 



The analysis of Section 2.4 is devoted to infrared divergences and their relation to parton 
distribution functions. It allows us to show that the soft, nonperturbative contributions to 
inclusive DIS can be attributed exclusively to the parton distribution functions of Section 



2.3 



to all orders of QCD perturbation theory. Such a statement is called a factorization theorem. 
The tools we will use to derive this result were originally developed by G. Sterman p7j, S. 
Libby Q, and J. Collins Q using the results of L. Landau Q. 

Since the perturbative physics in our process is related only to the microscopic degrees of 
freedom of QCD, quarks and gluons, all target dependence is isolated in the nonperturbative 



distribution functions. Section 2.5 shows how to use this fact to arrive at an operator relation 
between the nonlocal product of two electromagnetic currents and an infinite series of local 
partonic operators. Such local operator product expansions (OPE's) were first considered by 
Wilson in 1969 Since then, they have been used extensively to study the analytic structure 



of amplitudes and characterize scattering mechanisms. Using the results of Section 2.3 



derive the next-to-leading order coefficients of the OPE and discuss the interpretation of the 
local operators. 



Section 2.6 contains a summary and some concluding remarks about inclusive DIS. 
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2.1 Kinematics 



Deep-inelastic scattering (DIS) is a process in which a lepton is scattered off of a hadron at very 
high energy. The final state of this process will be a backscattered lepton and a complicated 
hadronic state which we denote as \X). We consider an electron scattering via the exchange 
of a highly virtual photon off of a proton, but the theoretical tools developed here also apply 
to weak exchanges and processes involving other hadronic states. At leading order in the 
electromagnetic coupling, the amplitude for this process is given by 



k's'X 



T ( (-*) / d*x J^(x)A^x)(-i) I d 4 yJ^(y)A u (y) 



ks,PS) , (2.1) 



where I have denoted the electron's initial (final) momentum and spin as fc(fc') and s(s'), 
respectively, and the proton's momentum and spin by P and S, respectively. The currents 
Jg = eipe-f^ipe and = J2f e f t Pfl >1 ' l l J f are the usual electron and quark electromagnetic 
currents, respectively. Recognizing that there are no photons in either external state[] and 
that the two currents are fundamentally different allows us to separate the matrix elements 
and arrive finally at the amplitude^] 

(k's' \ J?(0)\ks) (X \ J»(Q)\PS) g^(2ir) 4 6(px + k' - P - k) , (2.2) 



[k 1 - k) 2 

where px represents the 4-momentum of the state \X) and I have employed Feynman's gauge 
(£ = !)■ 

This is an extremely complicated object which is different for each possible final hadronic 
state. General processes such as this one are difficult to pin down theoretically. However, a 
great deal of information is contained in the inclusive cross-section, 

o cx -1 (fcV I J" (0)1 ks) (ks |J£(0)| k's') 9flu g a p 

xJ2(PS\^(0)\X)(X\j^0)\PS)(27r)^^(P + q- Px ) , (2.3) 
x 

for DIS. Here, I have defined q = k — kl. Such total cross-sections are much cleaner theoretically 
and make the relationship between microscopic and macroscopic degrees of freedom more 
obvious. 



at leading order in the electromagnetic coupling 
2 I have used the operator identity 0(x) = e lP ' x O(0)e~' p ' x , where P is the momentum operator introduced 



in Appendix B.2 
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Since the physics outside the brackets is quite well understood (it can be done within the 
framework of QED), we ignore it for the time being and focus on the hadronic object 



w * v = ^Y.( ps \ J ^ Q )\ x )( x \ J ^)\ PS )^) H ( p+< i-px) 

X 

= 4^ / d 4 ze lq - z (PS\J^(z),r(0)\PS) . (2.4) 

In deriving the final form I have used the fact that J^x \ X )(X\ = 1 f° r an y complete set of 
orthonormal states. Since is concerned only with quark electromagnetic currents, I have 
dropped the subscript q. The factor 1/4tt is a convention for future convenience. 

Let us explore some of the properties of W^ u . Hermiticity of the currents implies that 
(W^)* — W v ^ . also conserves parity since all of our interactions and currents do. 

Furthermore, electromagnetic current conservation implies q^W^" = q v W^ v = 0. These 
properties allow us to deduce the tensor structure^] 

WMV= f?^f_ g »A Fl ^ q 2 ) 

+(-V)V?<)>- 2) 

-ie^q a S p —G 1 {u,q 2 ) (2.5) 



v 



e^q a (vS-q-S P) fj —G 2 (v, q 2 ) 



for W, where v = q ■ P and M is the proton mass. The totally antisymmetric Levi-Cevita 
tensor, ^ va P ; is introduced with the convention e 0123 = +1 in Appendix |B.l| . The real scalar 
functions F\,2 and G1.2 are called structure junctions of the proton. Our task is to relate these 
functions to the nonperturbative wavefunction of the proton. Our three external vectors define 
six invariants. Since P 2 = M 2 , S 2 = —1/4, and S ■ P=0, we are left with two scalars and one 
pseudoscalar. The pseudoscalar q ■ S can contribute only linearly because its appearance is 
dictated by the Dirac structure of the matrix element. In four dimensions, one can reduce any 
Dirac structure to terms independent of or linear in S^. Parity invariance forbids the latter 
from appearing anywhere except in the tensor structures multiplying G1.2. This leaves us with 
the two invariants q 2 and v. 

3 The factors of 2 adorning G\ and G2 compensate for the implied factor of 1/2 in the spin vector S 1 '*. 
Note that this decomposition does not coincide with that found most often in the literature. These structure 
functions are proportional to those found in more standard treatments. 
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Our analysis up to this point has been quite general. The decomposition ( |2.5|) is valid for 
allQ values of q 2 and v. However, in order to make contact with the fundamental degrees of 
freedom of perturbative QCD, we must specialize to a specific region. In the last chapter, we 
saw that QCD becomes nonperturbative at low energies. This means that the familiar quarks 
and gluons of the QCD lagrangian are no longer the relevant excitations. At these low energy 
scales, we must consider collective excitations of quarks and gluons ('quasi-partons') rather 
than the degrees of freedom introduced in the last chapter. This behavior is reminiscent of 
solid state physics, where strong interactions between electrons and the lattice require us to 
consider 'quasi-electron' excitations rather than the familiar electron-ion excitations we see in 
'free' electrodynamics. After some reasonable assumptions, the lagrangian for these effective 
degrees of freedom can be derived directly from the fundamental lagrangian. It can easily be 
seen that the quasi-electrons interact weakly and are therefore the relevant degrees of freedom.^] 
Presumably, something similar is happening in our case. However, since we are dealing with 
an ultra-relativistic quantum field theory rather than non-relativistic quantum mechanics, we 
can no longer smoothly make the transition from fundamental to effective degrees of freedom.^ 
This leaves us with two options. We can either write down a theory which is not^j motivated 
by the fundamental theory, or stick to regions in which the effective degrees of freedom are 
the quarks and gluons of QCD. There are many models and theories which opt for the former, 
among them the nuclear shell model, Yukawa's pion theory, the constituent quark model, and 
chiral perturbation theory. Here, we attempt to learn something about QCD itself by taking 
the latter path. 

To this end, we will consider DIS in the Bjorken limit, formally defined as Q 2 , v — > oo while 

4 Strictly speaking, we must choose physical values for q 2 and v. We will see below that this restricts 
q 2 < and v > 0. However, our structure functions can be analytically continued to almost every region of 
the complex plane of q 2 and v. 

5 Rcnormalization can also be viewed in this way : the fundamental pointlikc fields in our lagrangian interact 
strongly with vacuum fluctuations. The more appropriate degrees of freedom are collective excitations of the 
fundamental fields and vacuum fluctuations, the renormalized fields. 

6 This is merely another signal of our incompetence. Certainly, a transition can be made; we just don't know 
where to begin. 

7 directly 
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Xb = Q 2 /2v remains constant. HcrcJ^j 

Q 2 = - g 2 = 2fc (fc') (l-cos6')=4fc (fc') sin 2 (6'/2)>0 , (2.6) 

with 9 the angle between the initial and final electron spatial momenta. This special kinematic 
regime admits quarks and gluons as appropriate degrees of freedom since they interact weakly 
at high energy. In this limit we can ignore the proton mass and all natural scales associated with 
QCD. A straightforward application of dimensional analysis^ reveals that W, and therefore all 
of our structure functions, is dimensionless. In the Bjorken limit, the only relevant scales in 
our problem are Q 2 and v. Hence our structure functions should depend only on their ratio, 
xb- While this is strictly true in the formal limit, for any finite Q 2 one will see violations of 
this scaling law coming from the dependence of the QCD coupling on scale. Since QCD is 
asymptotically free, these violations will get smaller and smaller as Q 2 is increased. This is 
one of the main reasons QCD was chosen as the underlying theory of the strong interactions, 
as detailed in the Introduction. 

The kinematics of the Bjorken limit allow us to write the vectors q and P in a very simple 
way. Since we have only two dynamical vectors, we can always choose a frame in which their 
spatial momenta are parallel to the 3-axis. In this frame, we can expand q and P in terms 
of the two basis vectors p M = A(l, 0,0,1) and n M = (1, 0, 0, — 1)/2A. A is an arbitrary scale 
reflecting our freedom to boost along the 3-axis. Note that we have chosen basis vectors which 
satisfy p 2 = n 2 = and n ■ p = 1. We will call the component of a vector which is parallel to 
p the '+' component and that which is parallel to n the ' — ' component. This implies 

k° = Ak+ + k-/2A ; fc 3 = Ak+ - AT/2A , (2.7) 



for any vector k. Our vectors have the expansion 



M 2 

P = p+—n 



~|2 



q = -CP+^n , (2.8) 



"ignoring the electron mass 

9 Our states are normalized via (p' \p) = 2p° (2^)'^ 8{p ' —p), as in Appendix D.l, so each state contributes 
mass dimension —1. 
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where 

C "2^( v - 1 + V 1 + ^J- > ^ (2 ' 9) 
as Q 2 — ► oo. Since the — component of P cannot form large scalars, we will ignore it whenever 
we can and simply write P = p. 

Due to the role played by the light cone in special relativity, truly massless fields are not 
associated with spin vectors. The analogous degree of freedom for these fields is the helicity, 
As shown in Appendix B^2, the replacement MS — > hP is valid as M — > 0. While this 



identification gets rid of the unsightly factors of M multiplying the spin-dependent structure 
functions, the tensor structure associated with Gi vanishes identically. In this term, we are 
certainly not free to take the proton mass to zero! As a consequence of this, we will see that 
the structure function Gi is suppressed in the Bjorken limit. 

We are now ready to discuss some of the contributions to DIS from various structure 
functions in the Bjorken limit. The lepton tensor we ignored above, 

L» v ee (jfeV | J£(0)| ka) (ks 1^(0)1 k's') , (2.10) 

can be calculated at leading order in QED. For massless electrons, only helicity eigenstates 
make sense. Since these do not interact with each other, the relation 

u„(fc, \)u(j(k, A') - ^6 XX , [(1 + 2A 75 ) #] a/3 , (2.11) 

where A, A' = ±1/2 label helicities, holds. Substitution gives 

= 2e 2 {k^k lv + k"k^ ~ g^k ■ k r ) - ^e^qaiXk)^ . (2.12) 

As required by electromagnetic gauge invariance, L conserves current if the electrons are both 
onshell. This makes the contraction W^ v L Vil especially painless. We obtain 



W^L Uf > = 2e 2 Q 2 F 1 {v,Q 2 ) 

P ■ h'\ 1 



+2e 2 



2 p . k' I 1 + — - ) - x B M< 



+2e 2 X MS ' (k + k ' ) Q 2 G 1 (u,Q 2 ) (2.13) 



v 



'1 use the convention h = ±1/2 rather than ±1. 
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If the initial and final electrons do not have transverse momentum components, the kinematics 
of the process require k = q~n and k' = —q + p. In this case, the quantity 

2P • k' = -M 2 q+ = x B M 2 + 0{M 4 /Q 2 ) (2.14) 

causes the contribution of F2 to be suppressed by M 4 /Q 4 relative to that of F\. Writing 

MS^ = s + p» + M 2 s-n^ + MSI > ( 2 - 15 ) 

on dimensional grounds, we see that G\ contributes at the same order as Fi and G2 is 
suppressed by (M 2 /Q 2 ). Processes with appreciable transverse momentum see the much 
smaller suppression (k^/Q 2 ) of F% and allow the transverse components of to contribute 
as (MS± ■ k±/Q 2 ) to G1.2, but do not alter the leading behavior. We will discuss these con- 
tributions in Chapter ^. For now, however, we ignore the subleading structure functions and 
focus on Fi and G\. 

2.2 Deep-Inelastic Scattering on Partonic Targets 

The Bjorken limit was not introduced merely as a way to get rid of two structure functions 
(although this aspect is not without its charms). Rather, it was alleged to help us make contact 
with the fundamental degrees of freedom in QCD. To understand how this happens, let us take 
a moment to think about the physical process at hand. Our proton is a conglomerate of partons 
interacting strongly with each other. The process we are interested in concerns the scattering 
of an intruder off this complicated mess. 

Fortunately, some general arguments allow us to simplify our task. Causality implies that 
our partonic interactions do not happen instantaneously. Consider, for example, two partons 
under the influence of the nonperturbative force that binds them together. Since they feel this 
strong restoring force, it stands to reason that they are nearing the end of their leash. Their 
separation distance, d, should be of the same order as the nucleon size and is related to the 
energy scale, Aqcd, at which QCD becomes nonperturbative : 

d~l/A QCD . (2.16) 

Since they are separated by this distance, the interaction requires a time of order 1/Aqcd in 
which to occur. On the other hand, our virtual photon is living on borrowed time. The form 
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of its propagator implies that it can only exist for a time of order M/Q 2 in the rest frame of 
the nucleon.|^| If Q 2 /M » Aqcd, the partons simply do not have time to scatter nonper- 
turbatively during the intruder's visit. Consequently, the intruder sees a frozen non-dynamic 
collection of partons rather than the complicated interacting soup we do not understand. By 
the time the partons can re-scatter, the process we are interested in is over and done with. 
These final-state interactions cannot affect the total cross-section for our process^ and hence 
are unimportant to our treatment. 

Since the partons do not interact with each other during the scattering, we can treat each 
parton as though it were alone. In this way, the complete scattering tensor becomes a 
weighted average over 'partonic scattering tensors' : 



Here, f a /T is the probability of finding a parton of type a in our target T and w£ v is the 
scattering tensor for a partonic 'target' of type a. The sum extends over all parton species, 
spin, and momentum. I emphasize here that this decomposition can be done only in cases 
where the scattering is incoherent, i.e. the partons do not interact with each other during the 
scattering.^] Measurements of the running of a s and the size of various hadrons reveals the 
strong interaction scale to be of order 300 MeV; as long as we have Q of order 2 — 3 GeV 

11 Its propagator is of the form 



For large q-x, small variations in x cause wild oscillations in the integrand which effectively damp contributions. 
Hence dominant contributions come only from regions where both q-x and x 2 are small. Since q^ is not light-like 
(far from it, in fact), this implies x^ is small. How small is dictated by the components of q^ in the frame we 
are using. For example, in the rest frame of the proton, the time alotted to our virtual photon is on the order 
of l/q° ~ M/Q 2 . 

12 They can, however, affect an exclusive cross-section like -y*p — > n2ir + ir~ n°. This process could've begun 
as 7*p — > pn , with re-scattering effects producing the observed final state long after the virtual photon 
scattering. This is one of the main simplifications provided by inclusive DIS. 

13 We have shown that for Q 2 /M S> Aqcd partons do not interact over long distances during the scattering. 
Short distance perturbative interactions certainly can take place during the scattering. However, the exchange 
of vast amounts of energy between partons within a nucleon is suppressed simply because there is not a lot of 
ambient energy in the medium. These kinds of parton correlation effects are suppressed by powers of Q. We 
will discuss a certain class of them in the last chapter. 




(2.18) 



a 




(2.17) 
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or more, the scattering can reasonably be considered incoherent. Some of the approximations 
of the last section require somewhat larger Q since they ignore the nucleon mass (roughly 1 
GeV). Ideally, inclusive DIS is considered at energies of order 10 GeV or larger. 

Equation ( [2.18 ) is a mathematical expression of the physical separation of scales in our 



process. It states that the full amplitude can be factorized into a product of two parts. The 
functions f a /T describing the probability of finding various partons within our target, called 
parton distribution functions, are nonperturbative in nature and cannot be calculated using 
the techniques considered here. These objects are related to soft, low-energy physics and do 
not depend on the specific scattering process. On the other hand, the partonic scattering 
tensors w% v are expressed entirely in terms of the fundamental degrees of freedom in QCD and 
can in principle be calculated in perturbation theory. They are associated with the hard, high- 
energy physics of the scattering and do not depend on the target we consider. This separation 
between hard and soft physics, called factorization, is indispensable to modern QCD. Many 
recent results in perturbative QCD involve identifying and isolating the relevant distributions 
analogous to f a /T f° r a certain process and calculating their coefficients. The remainder of 
this section is devoted to a calculation of DIS on partonic targets. We shall have a lot more 
to say about parton distributions in the next section. 

At present, perturbation theory is not readily applicable to the calculation of W because 
it does not involve a time-ordered product. We can remedy this in the following roundabout 
way. Consider the tensor 

d A ze lq - z {PS \r(0).P l (z)\ PS) . (2.19) 

Introducing a complete set of intermediate states and shifting the argument of J M to the 
exponent, we see that it is equal to 

Y / (PS\r(0)\X)(X\J»(0)\PS)(27r) 4 6(q+p x -P) . (2.20) 
x 

The sum extends over all physical intermediate states with energy-momentum px — P — q. 
Since we intend to work in the region^ (q°) r f » (P°)rf = M, there are no physical states 

14 The subscript rf reads 'in the rest frame of the proton'. Since the zero component of a four vector is not 
a scalar, we must specify a frame to give it meaning. 
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which satisfy the <5-function. Hence we can write 

~ — ( d A ze iq z (PS\[J^(z ), J" (0)1 IPS) , (2.21) 

47T J 

where the '~' is meant to remind us that this identification is only valid in the physical region 
of scattering, i.e. (q°) r f > 0.[^ We still do not have a time-ordered product, but can use the 
same trick on the tensor 



T» v = i I d 4 z e lq - z (PS \TJ»(z) J"(0)\ PS) (2.22) 

to obtain 

T^~i I d 4 ze{z°)e lq - z (PSHJ^z), J v (0)]\PS) . (2.23) 



The step-function, 6(z°), comes from the time-ordered product. The complex conjugate of 
(T^)* ~ -i [ d 4 zO(-z°)e tq - z (PS\[J u (z),J"(0)}\PS) , (2.24) 



differs from T vtl only in the argument of the 0-function. Writing 

e(z°) = i[l + sign(z )] , (2.25) 
where sign(a;) returns the sign of its argument, it becomes obvious that 

3m - 2 7 rW/^ l/) (2.26) 

for the symmetric parts and 

3?e T M - 2mW^ lv] (2.27) 
for the antisymmetric parts. Decomposing T^ iV as above, 

fa* 

O M 

-i^q a Sp Sx{v,q 2 ) (2.28) 

v 

- ie^q a (vS-q- SP) f3 ^-S 2 {v, q 2 ) , 



15 Since baryon number is conserved in all interactions we will consider, the state \X) must include at least 
one baryon. Hence its energy must be at least the proton mass. 
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(a) (b) 

Figure 2.1: The two leading-order diagrams for virtual photon-quark scattering. The crossing 
symmetry allows us to obtain one from the other. 

we see that the structure functions associated with W are just 1/27T times the imaginary part 
of those associated with T. This relation is only true in the physical region, but we are free to 
calculate our structure functions in any way we wish. Since T involves a time-ordered product, 
it is of exactly the form we need to apply perturbation theory. 

Let us consider scattering on a massless target of momentum xp. The leading order am- 
plitude involves a simple contraction of the quark currents with each other and the external 
states. The two relevant diagrams are shown in Figure 2.1. We need only consider one of them 
explicitly because the other can be obtained from crossing symmetry, q — > —q and fi — ► v. 
This is an exact symmetry of T, independent of the QCD dynamics, as can be seen directly 
from its definition. Figure 2.1a is trivial to evaluate. It has the valuep'l 

e 2 1 

7T -u(h,xp)y»(xll+iih v u(h,xp) , (2.29) 

lv xb — x — is 

where u(h, xp) and u(h, xp) are the Dirac wavefunctions of a free quark of momentum xp and 
helicity h introduced in Appendix B.3, and e q is the charge of a quark of flavor g.^] As above, 



the Dirac product can be simplified by writing 

u a (h, xp)up{h, xp) = -[(1 + 2hy 5 )xp'} a f3 , (2.30) 
which is valid for massless spinors. In this way, the amplitude reduces to the simple Dirac 

16 We have kept our definition of v = p ■ q rather than changing to xp - q since in the end we will be concerned 
with the tensor structures relevant to the true target of momentum P rather than this fictitious quark target. 

17 Here, I resort to the standard notation q for quark flavor rather than /. 
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trace 

$- - ^[7^+^(1 + 2^)^] . (2.31) 

iv xb — x — ie l 

The '1' contributes only to the symmetric part of , while 75 contributes only to the anti- 
symmetric part. Concentrating on the symmetric part for the moment, we find 

1 1 



rp(^) — 2 



q XG q 



xb — x — ie xb + x — ie 

2„2 







q 2 






v N 








I 2 q / 



■IMA — + 1 r , (2-32) 

\xb — x — is xb + x — ie J v \ q / \ Q J 

where the crossing term, q — > —q and fj, <-»• v, has been taken into account p| The structure 
functions relevant to our experiment are now easy to compute j^] 

qFi(x, v, Q 2 ) = xe 2 q ^5(x B -x) 

q F 2 (x,v,Q 2 ) = x 2 e 2 q 5(x B -x)=2x Bq F 1 (x,x B ) . (2.33) 

Here, we see explicitly that the structure functions scale; they depend only on the ratio 
xb between of the two large scales v and Q 2 .^| Along with the Callan-Gross relation, q F 2 = 
2xb q F\, scaling is a general property of this kind of scattering process. This behavior can be 
shown quite generally via the current algebra techniques of Gell-Mann |g5| . The fact that it 
is broken by quantum effects was one of the main motivations of D. Gross' search for a proof 
that quantum field theory cannot describe experiment. His subsequent discovery of asymptotic 
freedom turned his argument on its head and led to the realization that only a theory like QCD 
can be consistent with experiment. 

The antisymmetric part of T is given by the 75 term in the trace. The result is 

.,2 



TJT ] - -ie^q a (hp)^ ( 1 — + 1 ) . 

V \X B — X — IE Xb+X + IE/ 

This implies 



(2.34) 



q G 1 (u,Q 2 )=xe 2 q -S(x B -x)= q F x {v, Q 2 ) . (2.35) 



18 Note that the crossing is not exactly //,-(-* u and q — » — q since the +ie must also change sign. This fact is 
related to the analytic properties of T^ v as a function of xg . 

19 The +ie's introduced in Section 



l.S play a pivotal role here. 



20 The dependence of T q on the + component of the quark momentum is viewed as dependence on the type 
of parton. 
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Once again, we see scaling. 

The presence of 8{xb — x) in our structure functions is easy to understand from the stand- 
point of 'physical' quark scattering. At leading order, the only intermediate state accessible 
to the quark-photon system is an excited quark. If this state is to be physical, the quark must 
be onshell. This implies 2xv = Q 2 , or x = xb- 

Our amplitude requires corrections due to virtual gluon exchange, as shown in Figure 
2.2. As presented, these diagrams are riddled with ultraviolet divergences. The discussion on 



renormalization in Section 1.4 tells us how to deal with these divergences. It is simplest to use 
the MS scheme in Feynman's gauge (£ = 1). In renormalized perturbation theory, there are 
three relevant counterterms. The obvious insertion is i(Zp — l)(x$+tf) for the intermediate 
quark propagator. The other two come from the electromagnetic sector of the theory. 

Consider for a moment the quark-photon interaction term in QED. Its structure is identical 
to the quark-gluon interaction with t a replaced by 1: 

£?™ ~ -Q&A Mb , (2.36) 

where Q q is the quark charge, e b is the bare electromagnetic coupling]^] and Ah is the bare 
photon field. Defining our Z-factors in the same way as for QCD, we see that this leads to the 
counterterm 

- {Z e Z F z)( 2 - \)Q q ei>^ (2.37) 

in the QED lagrangian. Electromagnetic gauge invariance requires the running of the cou- 
pling to be the same for all electromagnetically charged particles. Hence the QCD ultraviolet 
divergence structure of the quark electromagnetic current must be identical to the divergence 
structure of the quark self-energy. This must be satisfied diagram by diagram, i.e. for each 

21 The electromagnetic coupling is a parameter which summarizes the strength of the interaction of all 
charged fields with the photon. It is analogous to the QCD coupling g. The charge of a field describes the way 
a field transforms under the gauge group. It is analogous to the 3 of SU(3). For example, the charge of an up 
quark is 2/3 since under an electromagnetic gauge transformation parameterized by a its phase rotates by the 



amount 2o/3. In contrast, the (renormalized) electromagnetic coupling e ~ yj 4-7r/137 ~ 3/10 is the same for 
all electromagnetically charged fields. When the difference between these two objects is inessential, it is often 
ignored by introducing the 'quark electromagnetic charge' e q = Q q e. However, one must always remember 
that they are not the same. In particular, the coupling e is just a number while the charge Q is a quantum 
mechanical operator describing the behavior of fields under a gauge transformation. 
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QCD diagram the sum of all possible electromagnetic current insertions must have an ultra- 
violet divergence which is identical to (the negative of) that of the corresponding self-energy 
diagram. The cancelation, being a consequence of electromagnetic gauge invariance, is com- 
pletely independent of the QCD dynamics. 

The full counterterm contribution to the amputated diagrams is taken into account simply 
by multiplying the leading order result by Zp. While this is all that is necessary to cancel the 
ultraviolet divergences, it will not result in a properly normalized amplitude. Since we have 
not used the onshell renormalization scheme, the residue of our renormalized propagator is 
not i. While this fact is unimportant for intermediate quark propagation, our external quark 
wavefunctions must be correctly normalized.^] This is done exactly as in the onshell scheme : 
we multiply the amputated Feynman diagrams by the full (1PR) propagators, (re)renormalize 
the external states by dividing by the square root of the propagator residue^ multiply by 
the inverse propagator, take the onshell limit, and attach external free wavefunctions. This 
procedure is known as Lehmann-Symanzik-Zimmermann (LSZ) reduction formula |^T| . It is 
quite general and applies to all quantum field theories, regardless of whether or not they are 
renormalizable. Since the full propagators contain a power of the residue,^ the end result is 
simply multiplying the amputated graphs by the square root of the residue for each physical 
external state. 

To see this explicitly, let us examine the fully renormalized quark propagator in this scheme. 
The 1PI graphs sum to — = —iB(p 2 ) ]/, with a counterterm contribution i(Zp — 1)^- The 
full propagator becomes 

The residue of this object at p 1 = is [Zp — B(0)]~ . Since B(p 2 ) is dimensionless, it may be 

22 Since this calculation does not represent actual physical scattering, the difference between properly and 
improperly normalized quark fields can be absorbed into the distribution functions. However, if we wish to 
relate these functions to the effective high-energy degrees of freedom of QCD, we has better normalize them 
properly. 

23 The word 'residue' is used in this context to mean 'mathematical residue divided by i\ or 'thing that 
should be 1'. 

24 Actually, since we intend to take the onshell limit after all of these manipulations, the full propagator is 
effectively just the pole contribution. 
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2 \ -ne/2 



OO 




(2.39) 



Since our renormalized amplitude is free of ultraviolet divergences, we are free to take e < 0.[^| 
In this case, it becomes obvious that B(Q) = 0. This implies that the correct amplitude 

— 1/2 

is obtained by multiplying the amputated result by Z F for each external quark leg, or 
1/Zp in all. It must be emphasized that this factor is not ultraviolet divergent.^ All of 
the ultraviolet divergences have been removed by renormalization. This factor contains the 
infrared divergence generated by taking the onshell limit. However, as long as we recognize 
that there are no ultraviolet divergences left in the problem, we may allow this Zp to cancel 
the one from the counterterms. This means that after all is said and done all of the Zp factors 
necessary for a properly normalized amplitude cancel. The true amplitude is simply the sum 
of the diagrams, without a thought given to counterterms or proper normalization. We must 
keep in mind, though, that all of the divergences we run into from now on are infrared and 
cannot simply be swept into renormalization constants. 

In unrenormalized perturbation theory, the story is somewhat different. The procedure 
here is to calculate all diagrams, renormalize external states, and express the result in terms 
of renormalized quantities at the end of the day. The result of these manipulations is guaran- 
teed to be an ultraviolet finite quantity. In addition, onshell external states require infrared 
(re)renormalization as discussed above. In our case, the electromagnetic currents do not require 
renormalization at this order in QED|^| As mentioned above, all of their ultraviolet divergences 
cancel with intermediate self-energy diagrams. The external states, on the other hand, require 
an overall division by Zp to cancel their ultraviolet divergences. The amplitude we are left 
with is guaranteed to be free of ultraviolet divergences. Once again, we (re)renormalize the 
onshell external states through multiplication by an infrared divergent Zp. The end result is 

25 As before, the fj,o's appear to keep the coupling dimensionless. 

26 We cannot do this in the unrenormalized amplitude since e is required positive to regulate the ultraviolet 
divergences. Only in an ultraviolet finite quantity can we take e < 0. 

27 This is why we never had to take it into account before. If we are concerned only with the ultraviolet 
behavior of an operator, it is best not to take the onshell limit so this kind of 'mixing of divergences' does not 
occur. 

28 At higher orders, the renormalization of these currents is reflected in the renormalized coupling. 
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(b) (c) (d) (e) 

Figure 2.2: Contributions to virtual photon-quark scattering at (a) leading and (b)-(e) next- 
to-leading order in the QCD coupling. 



that all of our Zp factors cancel; we need only calculate the amputated diagrams and attach 
onshell external wavefunctions. 

The final step in the LSZ program involves taking the onshell limit of our amplitudes. 
As we have seen, this leads to infrared divergences in general. Although they seem like a 
terrible defect at first, the appearance of these divergences is quite natural. Their origins and 
treatment are discussed in the next section; for now, we simply acknowledge their existence 
and proceed. 

There are several ways to do this calculation. One method is that used for the leading order 
amplitude. The unpolarized (symmetric) part of T is calculated by averaging over external 
helicities and the polarized (antisymmetric) part is obtained by taking half the difference 
of right- and left- handed helicities. In this way, both parts become a trace which can be 
simplified before performing the loop integration. Unfortunately, subtleties associated with 
the definition of 75 in d dimensions^] make the polarized result ambiguous. Thus, in order to 
resolve this ambiguity, one must choose a consistent 75 scheme. The most natural thing to do 
is simply perform the loop integration over the full Dirac structure, leaving the introduction of 
75 for later. This approach has the advantage of obtaining both the polarized and unpolarized 
amplitudes simultaneously. After the loop integration, the only vectors left over are external 
and may be taken to be in the first four dimensions. At this point, we are free to use the above 

29 see Appendix F?l 
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method to extract the symmetric and antisymmetric parts explicitly. 

The intermediate quark self-energy diagram, Figure 2.2b, can be done using the techniques 



of Appendix A. 3. Ignoring terms of order e, the result is 



a s C F [Q 2 e^ E \ 6/2 \2 , x B - x - ie 
+ 1 - log 



47r V 47I 7^o 



e x B 



lv x — xb + is 

+ (p «->!/, ?->-«) , (2.40) 



where the external states have been omitted. Once again, we see the addition (/i u, q — ► —q) 
from the photon crossing diagram. In the interest of space, this crossing contribution is implied 
in all of the following results unless indicated otherwise. As before, the +ie plays a fundamental 
role in the extraction of the imaginary part of the amplitude. Note that we can ignore it here 
simply by endowing xb with a small negative imaginary part, xb = xb ~ is- This behavior 
persists in all of our diagrams, provided that the crossing is obtained by xb — * — In 
the following, we drop the tilde on xb and simply remember that xb has a small negative 
imaginary part. 

The calculation of the remaining diagrams is greatly simplified by taking \x and v in the 
transverse directions. A look at the tensor structures relevant to F\ and G\ tells us that 
this simplification is free of charge - none of the leading dynamics is affected by it. Since 
the external states in our calculation satisfy the free equation of motion, rfu(xp, h) = 0, the 
intermediate self-energy diagram may be rewritten for transverse fiv as 



a s C F / 


/ Q 2 e 7E \ 


-e/2 r 


4tt ' 


v 4vr M 2 ) 





2 x B - x 

- + 1 - log 



1 „2 



e\ 1 



2 x — xb 



^fiY . (2.41) 



€ X B 

We will see that all of our amplitudes can be expressed in terms of this one simple Dirac 
structure and its hermitian conjugate. 



An application of the formulae in Appendix A. 3 to the integrals encountered in the re- 
maining diagrams is possible, but extremely tedious. This method of covariant integration 
does not use the special kinematics of our problem. A superior method^] breaks the inte- 

30 One can see this by examining each diagram separately, keeping the e's explicitly. Alternatively, a study 
of the amplitude in the complex rrg-plane reveals T to be an analytic function of xg with branch cuts along 
the real axis stemming from ±x. The discontinuity relevant to our physical structure functions can then be 
obtained by approaching the positive real axis from underneath. 

31 This method is identical to that employed in Section |l.6| . The presence of a nonzero scale complicates this 
process, so we discuss its application explicitly in this case. 



gration into +, — , and _L components : d d k — dk + dk~ d d ~ 2 k±. The definitions of + and — 
ensure that the measure is invariant under this rotation. Defining y = k + , we may substitute 
fc' 1 = yp^ L + k~rt L + into our expressions. The fact that n 2 — p 2 = and none of our 
external momenta have transverse components leads to a tremendous simplification. In par- 
ticular, the large k~ behavior of the integral is never worse than / dk~/(k~) 2 . This sets the 
stage for a contour integration in the fc _ -plane. Restoring the -He's, we can analyze the pole 
structure of the integrand. Since k~ is always accompanied by a function of y, the position 
of its pole will drop from above the real axis (y « 0) to below the real axis (y ^> 0) as y 
makes its trek from — oo to +oo. The precise position of the drop point depends on the pole 
in question, but for either very large y or very large —y all of the poles appear either below 
or above. We are free to close the contour in the pole-free half of the plane in these regions, 
giving no contribution to the integral. For intermediate y, we may close the contour in either 
the upper- or the lower-half-plane; Cauchy's theorem assures us the result will be identical. 
The transverse dimensions now have an integral of the formp| 

f d d - 2 k ± aQ 2 +pkj 

J (2Tr) d - 2 {k\ +M 2 )(k 2 ± + N 2 ) ' [ ' 

where a and (3 are functions of x,xb, and y, which can easily be done via the methods of 
Appendix [e]. The final integration can always be done with the formula 

/V--(l-^ = ™ (2.44) 
Jo T(a + (3) 

after a suitable change of variables. 

This analysis is complicated by the existence of physical intermediate states for xb > x. 
These states lead to branch cuts in our amplitude and generally get in the way. We can avoid 
them during the calculation by working in the unphysical region, xb » x. At the end of the 
day, we are free to analytically continue xb to the physical region of interest. 

The two vertex diagrams (Figs. 2.2c and d) are related by hermitian conjugation^ and [iv 

32 We can always choose N 2 = by refusing to pick up the Q 2 -dependent pole. In this case, both integrals 
reduce to 

f rl d ~ 2 h , 1 

(2.42) 



(2n) d - 2 k 2 ± + M 2 ' 

33 This conjugation is to be understood as taking place only in the Dirac sector. The — ie associated with ij 
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exchange, as can be seen directly from their expressions. This symmetry leaves us with only 
two diagrams to calculate explicitly. Including both contributions, the vertex gives 



C( s Cf 2 ( Q e 



2 P 7E\ " e / 2 



1 



1 



Xb — x \e 



x xb — x 



4 . x B - x 
- + 3 - log 

e x B 



(2.45) 



1 



log 



xb — x 
xb 



xb — x 

The presence of a squared propagator in the final diagram (Fig. 2.2e) complicates matters 
slightly. However, since there are still only three poles, our method can easily be modified to 
handle this situation. We obtain 



a s CF 2 I Q e 



2„7E \ " e / 2 



4vr/i^ 
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(7'V7"+7'W) 



2 1 xb — x 
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e 2 x B 



(2.46) 



+ 2(7W~7W) 



ig-i, xb — x 
— log 

x A xb 



All of our diagrams contain infrared divergences^ which do not cancel in the full amplitude, 



ol s Cf 2 ( Q e 
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-7^7" 
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x B + x 



x' xb — x 

f xb — x 1 

\ x 2 xb — x 



+ - -(VW +7^7") 
e x 



x z xb 



4 x B - x 
- + 3 - log 

e x B 



log — -7"^ 7" (2.47) 

xb 



as promised in the beginning of this chapter. At this point, we are free to project out the 

does not change sign. Since the final state wavefunction comes with a 7 , u = u^°, the conjugation simply 
reverses the order of the 7-matriccs. 

34 Strictly speaking, the self-energy diagram we calculated first does not contain any infrared divergences. 
It contains only ultraviolet divergences which are destined to cancel with counterterms. However, since we 
allowed the ultraviolet divergences in our counterterms to cancel the infrared divergences associated with the 
LSZ reduction formula, these ultraviolet divergences have been transmuted into infrared ones. 
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(a) 



(b) 



Figure 2.3: Leading diagrams for gluon-virtual photon scattering. 



structure functions as above. We obtain 
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for the symmetric part and 
Sf(x,x B ) = - 
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(2.48) 



(2.49) 



for the antisymmetric. In these expressions, the crossing contribution has been included ex- 
plicitly. 

At this order in QCD, virtual photons can also scatter off gluons in our target via the 
virtual quark loop diagrams in Figure 2.3. These diagrams must be ultraviolet finite since 
there are no counterterms at this order. The two gluons have transverse polarizations^] i and 



35 As explained in Appendix B.2, massless particles are classified by their helicity. For vector objects, like 
the gluon, the form of the rotation generators implies that helicity eigenstates are obtained only for transverse 
polarization. 
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j. Once again, there are polarized and unpolarized contributions to the amplitudes. These can 
be obtained in exactly the same way as for the quarks, by performing the integration first and 
projecting out the symmetric and antisymmetric parts of the amplitude later. When averaging 
over the gluon polarizations in d dimensions, we divide by d — 2 rather than 2 because this is 
the number of polarization states available to a gluon in d dimensions. The result is 



Tf(x,x B ) = 
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e/2 
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(2.50) 



for the symmetric structure andf^] 



Sf(x,x B ) = 



a sT f 
2vr 



£' 



2 P 7-B 



Q 2 e 



4tt^ 



-6/2 



1 + 2 £*Z£) f! +4 _ log ^^ | _ 2 
x J \ e xb 



log 



xb — x 



xb 



{x b — > -a;^) 



(2.51) 



for the antisymmetric. Once again, we see infrared divergences. These divergences are inti- 
mately related to those in the quark sector, as we will see in the next section. 



2.3 Infrared Divergences and Parton Distributions 

As we have seen, DIS on onshell massless quark and gluon targets contains infrared divergences 
which do not cancel in 'physical' amplitudes. One might say, "Who cares? These amplitudes 
are not physical anyway, so it's not important that they are unavoidably divergent." . However, 
the physical arguments of last section indicate that these amplitudes contribute to physical 
scattering processes involving hadrons. The relation between physical and partonic amplitudes 
involves probability functions which are certainly not expected to diverge. Furthermore, since 
non-abelian nature of QCD contributes only trivially to the calculations of the last section, 
there is no way that our calculation can 'know' that the process isn't physical. In fact, this 

36 For the antisymmetric structure, the easiest way to get the result is by explicitly taking the photon and 
gluon indices in the 1- and 2-directions and interpreting the result as multiplied by iAe _+12 . 



102 



problem also exists in QED where the process is physical. How, then, can we make sense of 
these divergences? 

In QED, one argues that the quantity we have calculated is not really physical. If we 
were to consider an actual experiment, we would have no way to distinguish an electron of 
momentum k from an electron-photon system of momentum k in which the photon carried 
either a very small (soft) momentum or moved collinear to the electron. These two situations 
are considered separate in the calculation. If one wants to compare with experiment, one 
should add the cross-section for these processes to this one. It is from precisely these regions 
of momentum space that the infrared divergences come.p 7 ] 

Since infrared divergences come from regions of soft or collinear momentum, they are 
associated mainly with long time scales. The arguments of last section then imply that these 
regions are independent of the hard scattering. They correspond to processes in which, for 
example, a quark splits into another quark and a collinear gluon long before the scattering. 
In effect, it is only the second quark which participates in the hard scattering^] This fact is 
in direct conflict with the definition of w£ y . These quantities were defined as that part of the 
high-energy scattering which involves a parton of type a. The above discussion implies that 
the object we calculated contains a piece in which the parton relevant to the scattering is of a 
different type. Apparently, this new type of parton can be found inside our old parton with a 
certain probability. This is reminiscent of the discussion of last section. There, we considered 
the target as a loosely (over the relevant time scale) bound collection of partons and discovered 
that the scattering takes place with each parton incoherently. Here, we see that by probing at 
a large scale the partons themselves can break up into other partons. This suggests that we 

37 It is also precisely these regions of momentum space that one cannot properly take into account in QCD 
perturbation theory. The /3-function of QCD implies that the coupling will grow as fi 2 is decreased until it 
no longer makes sense to use perturbation theory. Of course, at that point we don't know precisely what 
the behavior of the /3-function is because we cannot calculate it. However, one thing is sure - in the regions 
associated with the infrared divergences of the last section the dynamics of QCD is nonperturbative. This fact 
will be addressed below. 

38 This analysis is complicated by diagrams like Fig. 2.2c. However, as we will see below, its inclusion does 
little more than ensure the gauge-invariance of the distributions f a /T- 
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consider our partons as targets in the sense of Equation ( 2.18 ) : 

Tr = E /»'/« C • ( 2 - 52 ) 

a' 

In the last section, we calculated the left-hand-side of this equation rather than the coeffi- 
cients t£ v on the right-hand-side. The nature of these coefficients implies that they cannot be 
calculated directly. Our procedure for perturbative calculations necessarily involves integra- 
tion over all portions of momentum space, which will lead unavoidably to contributions from 
low-energy. The functions f a '/ a , on the other hand, involve no explicit momentum cutoff. It 
is merely expected that they do not receive corrections from asymptotically large momentum 
scales. Unfortunately, the regions in which they are expected to be generated are nonpertur- 
bative in nature. We have already included contributions from these regions in T£ v . 

Although these contributions have been calculated incorrectly, the method we used can 
be extended consistently to incorrect perturbative calculations of other quantities, f a > i a in 
particular. Since the incorrect contributions are calculated in the same way, we can use 



Equation (2.52) to cancel them out and extract correct expressions for t£ v order by order in 
perturbation theory. It is these quantities (rather, the imaginary parts of their form factors) 
which appear in the physical cross-section and will be free of divergences. A formal proof of this 
statement involves a complete classification of the regions which generate infrared divergences 
in DIS and an understanding of the relationship between these regions and the regions which 
generate the parton distributions. An outline of such a proof appears in the next section. For 
now, let us see explicitly how it works at one-loop order. Along the way, we will learn a great 
deal about the parton distributions. 

In order to calculate parton distribution functions in perturbation theory, we must write 
them as matrix elements. The probability of finding a quark of flavor q' and momentum xp 
in a quark state of flavor q and momentum p is the same as the probability that I can remove 
a quark of flavor q' and momentum xp¥^\ from a quark state of flavor q and momentum p and 



39 The final expression we will use to define fqiy / q h( x ) actually receives contributions from all partons whose 
momentum k satisfies n-k = x. The other components of k give subleading contributions to the hard scattering. 



These kinds of corrections are discussed in 



Chapter |i[ 
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be left with a physicaf°| state : 

WW ~ ^ <ph\tl,(0)\X)(X\il>tf(0)\ph)8(l-x-n.px) 
x 

^e-^(ph\^,(Xn)^ q ,(0)\ph) . (2.53) 

Z7T H 

This expression can be viewed as an impressionistic view of the distributions functions at best 
since its derivation completely ignores the requirement of gauge invariance and the presence 
of spinor indices on the quark fields. The latter complaint concerns dependence on /i'.P] Since 
we wish to consider both values, we will leave the spinor indices free for now, but multiply ip' 
by 7 to form a true Dirac index. Achieving gauge invariance is also quite simple. According 
to Appendix our distribution is in need of a gauge link between and An. The path the 
gauge link traverses should be along n M to minimize its dependence on A. In this way, we 
arrive at the expression 

dX 



{M ql/q ) a[j (x) = / — e~ ikx (ph |^ (An) Q n (An, 0) (0)\ph) . (2.54) 



Although M. is the minimal gauge invariant extension of (2.53), its interpretation is quite 
different. The first manipulation changes it from a probability to a density matrix. This allows 
us to consider both helicity states simultaneously, much as considering f a i i a as a function of 
x allows us to combine a continuously infinite number of probabilities into one probability 
function. Our density matrix can readily be split into two probability functions with the help 
of some projection operators. However, the explicit form of the gauge link 

G n (An, 0) = P le- 19 So n - A ^ d < X (2.55) 



reveals that the second manipulation has changed the nature of our simple distribution. Rather 
than the removal and subsequent replacement of a quark, M. describes a correlation between 
quark and gluon fields. This complication is unavoidable in gauge theories because quarks 
do not make sense by themselves. The fact that a quark field transforms under a gauge 
transformation implies that 'the probability of finding a quark' is itself not a gauge invariant 

40 In calculations with partonic external states, the word 'physical' is used to mean physical in the absence 
of confinement. I.e. a transversely polarized onshell gluon is a 'physical' state while an offshell quark is not. 
Since ghosts are nothing but mathematical artifacts, they cannot be present in 'physical' states. 

41 Parity invariance implies that we can take h = +1/2 without loss of generality. 
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concept. However, once we choose a gauge this statement has meaning. In particular, the 
gauge restriction n ■ A = makes our gauge link trivial. In this gauge, we are free to interpret 
M as a true quark density matrix. Unfortunately, the difficulties associated with this singular 
gauge encourage us to make a less controversial choice. Nevertheless, its existence has taught 
us that the gluons appearing in the gauge link are unphysical in the sense that they can be 
removed by a gauge rotation.^] Since they are unphysical, we can view them as gauge artifacts 
and continue to interpret M as a quark density matrix. 

The gluons fields in the gauge link are evaluated at space-time points between the two 
quark fields. These contributions are generated in DIS by diagrams like Fig. 2.2c in which a 
gluon interacts with the quark during the scattering. Although the arguments above indicate 
that there is not enough time for this to occur, the inclusion of this graph is required by gauge 
invariance. Since all of our arguments concerning factorization have depended critically on 
the idea that no low-energy QCD interactions can happen during the scattering, this process 
could potentially be very dangerous. The offensive contribution is of the form 

„ i+i q vt+4 u , 2 56) 

7 (p + ?) 27 (yp + qr 1 ( 56) 

in the collinear region, where the gluon has momentum (1 — y)p. In the Bjorken limit, the 
leading contribution has Dirac structure 7^7" ^7". This structure vanishes unless a = — , so 
leading behavior is associated with the field A + = n ■ A^\ This implies that our troublesome 
graph does not contribute to the leading behavior in light-cone gauge. Furthermore, since the 
gluon momentum is proportional to p^ , the leading region is equivalent to one in which the 
gluon field has been replaced by its momentum. Writing 

(1 - y)p 7^ 7" Vt+j y" ~ yf yl+j y _ y yv (2 57) 

(p + q) 2 (yp + q) 2 (yp + q) 2 {p + q) 2 

it becomes obvious that this gluon insertion does not constitute a nontrivial contribution to 
the scattering in the infrared region. In reality, this graph simply represents a combination 
of scattering off of a quark of momentum yp (just the quark) and a quark of momentum p 
(coherent scattering off of the quark and the gluon combined). We will see in the next section 

42 This fact is critically dependent on the choice of path implicit in the definition of the gauge link. Any 
other choice will necessarily introduce physical gluons, destroying the desired interpretation of M. 

43 Remember that 7" came from the interaction^ = .4+ 7" + A~ 7 + — A.± ■ yj_. 
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Figure 2.4: Feynman rules for eikonal line propagation. 

that this behavior generalizes to any number of gluons to all orders in a s as a consequence of 
current conservation in QCD. Returning to the parton distribution, we see that the gluon fields 
in the gauge link are generated by graphs like these. They are required by gauge invariance, 
but the same principle also forbids them from having a nontrivial effect on the hard scattering. 
Nonetheless, they are present and must be taken into account in any nonsingular gauge. 

Now that we have unambiguously defined what we mean by a parton distribution function, 
we would like to calculate it in perturbation theory. Once again, we find ourselves in an 
awkward situation because of the lack of a time-ordered product. The trick we used in the last 
section cannot be applied here because the term we would like to add is not strictly zero. More 
importantly, the gauge link will not permit this sleight-of-hand. However, we can massage this 
quantity into a more useful form by writing 



*(An) 



<?„(oo, Xn)ijj(Xn) 



Z7T 



*J (An) (0) 



ph 



(2.58) 



dX 



— iX(x+n-px— 1) 'y ' 
X 



ph 



K (o) 



X) X 



n (°) 



ph 



The fields "J are gauge invariant in the sense that gauge transformations at infinity can be 
taken as trivial in perturbation theory.^] Since the 0-component of n M is positive, the path- 
ordering of the exponential is equivalent to a time-ordering operation in the matrix element 
on the right. This exponential represents a coherent sum of gluon fields propagating from to 
oo . Coherent propagation of this kind is known as an eikonal line and plays an important role 
in proofs of factorization. The eikonal propagation itself is represented in Feynman diagrams 
as a double line. The Feynman rules for its propagation and interaction with gluon fields are 
shown in Figure 2.4. 

44 The nontrivial topology of our gauge group will cause problems here if we intend to do a calculation outside 
of perturbation theory. 
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(b) 

Figure 2.5: Contributions to the 'quark distribution in a quark' at (a) leading and (b)-(d) 
next-to-leading order in QCD. The final state in (a) and (b) is the vacuum, while (c) and (d) 
represent gluonic external states. 

The calculation of M. is now straightforward : we simply draw all Feynman diagrams 
corresponding to the matrix element on the right for a given final state, sum their contributions, 
multiply by the complex conjugate^], and sum over final states. For quark external states, the 
necessary diagrams up to order a s are shown in Figure 2.5. At this order there are only two 
final states - the vacuum and the one gluon state. The sum over gluon final states involves 
an integration^] over onshell positive-energy momenta and a sum over polarization states. 
This integration is greatly simplified by the (5-function constraint on the gluon's + momentum 
imposed by the A integration. The polarization sum in Feynman's gauge is given by —g^v 

The only subtlety left is that all corrections to this object seem to be exactly zero in 
dimensional regularization due to the lack of scale. As discussed in Appendix || this zero 
represents a cancelation between infrared and ultraviolet divergences. However, we really wish 
to consider a renormalized matrix element defined at a certain scale /i 2 to be free of ultraviolet 
divergences.^] This object is formally defined through its moments, which will be introduced 
in Section |2.5| . The local operators they are related to can be renormalized with a Z-factor 
as with the quark-gluon current ijj fiip in the last chapter. However, at this order we can 
renormalize simply by subtracting the ultraviolet divergence in a particular scheme. 

To isolate the ultraviolet part of the divergence, we regularize the infrared by taking p 2 

45 To be explicit, I should mention that the complex conjugate must be multiplied by ■y from the right. 

46 in d dimensions 

47 Strictly speaking, we cannot take p 2 — > until we renormalize our matrix elements because this procedure 
leads to unregularized infrared divergences. Until our distribution is ultraviolet finite, we need e > to 
regularize ultraviolet divergences. In this region, lim p 2^o(P 2 ) — ^ i s certainly not zero. However, once the 
ultraviolet divergences have been removed, we are free to take e < and p 2 — » without ambiguity. 
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vanishingly small but finite.Q This has the effect of postponing the cancelation of infrared 
and ultraviolet divergences until the ultraviolet divergence can be identified and removed. 
In addition to the direct subtraction, division by Zp is necessary to properly normalize the 
external quark states.^] In the MS scheme, the full result isf"| 

( a s C F 2 /^ 2 e 7BX " c/2 



M q ,/ q (x) = < 6(1 -X) - 



2n e \ An^l 
1 + x 2 ' x 



l-x 



Ml-,) I dy 1 -^ 

o 1 -y 



+ f Sqq'Uqip, h)u q (j>, h) 



(2.59) 



Since u(p, h)u(p, h) = (1 + 2/175) ]^/2, we can split the density matrix into two scalar pieces : 

1 



(M q > h >/qh) a/3 
fq>/q{x) 



= *fq'/q( X ) + fq'/q( X ) (7sMa/3 



2 

1 r dx 



—iXa. 



2tt 



*(ph 



r (Xn)G n (Xn,0)i^ (0) 

e/2 



ph 



§{1 _ x) _ 0*0*2 

2lT € \ 



2tt e \ 4:TT[1q 



i + y_[ 

y 



fq>/q(x) 





fd_X 


-i\x 




2?r 




<W < 


6(1- 


x) 



/ (Xn)g n (Xn,0)rli> q ' (0) 

e/2 



+ 0(a 2 s ) 
ph. 



,C F 2 ftf 



2n e \ 47t^q 



1 + x 2 , f 1 , 1 + y 2 



l-x 



6(1- x) / dy- 

Jo 1 -y 



(D(aj 



(2.60) 
(2.61) 



(2.62) 
(2.63) 



(2.64) 



As mentioned above, the helicity-dependent scalar distributions can be projected out of 



48 This amounts to neglecting it everywhere except in the denominators. 

49 In renormalized perturbation theory, this factor is required to renormalize the external states; in unrenor- 
malized perturbation theory, it renormalizes the quark fields. 

50 The divergent integral here will be discussed below. For now, we just treat it with the love and care it 
requires as an ill-defined mathematical object. 
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A4. Taking the target helicity positive, we have 1 




-Tr -{l± lb )Mi 

2 [fq'/q( X ) ± fq'/q( X )) 



(2.65) 



fq'/q( X ) 



/«'+/«+ + fq'-/q+ > 



(2.66) 



fq'/q{ X ) 



fq'+/q+ ~ fq'-/q+ • 



(2.67) 



Hence f q '/ q {x) represents the probability that a quark of flavor g' and momentum a;p will 
be found in a quark of flavor g and momentum p regardless of its polarization, and f q >/ q {x) 



represents the difference between the same helicity distribution and the opposite helicity dis- 
tribution. Our result above indicates that at one-loop order the probability of finding a quark 
with opposite helicity is zero. This is in agreement with the fact that QCD is a vector theory, 
and as such does not change the helicity of its constituents. However, higher order radiative 
corrections can generate opposite helicity fermions through loops. This simplification is only 
an artifact of the low order at which we work. 

Since QCD dynamics are independent of flavor ,P| quarks can only be generated in flavor- 
singlet pairs. Every contribution to f q // q (x) must be countered by an equal contribution to 
an antiquark distribution function f q > / q (y). This suggests that the integral of the difference 
between quark and antiquark probability distributions for any given flavor cannot be altered 
by QCD dynamics. This is, in fact, true. The integral 



is completely independent of QCD dynamics and returns an integer, the net number of quarks 
of flavor q, for every target T. Of course, in order to have a rigorous statement, we must first 
define the antiquark distribution. WritingFI 



51 The yi here appears simply because it allows us to project out a scalar distribution involving single-helicity 
fields. Alternatively, one can simply multiply M by (1 ± 7s)/2 on the left and work out the contributing 
functions. 

52 assuming, as always, that all quarks are massless 

53 Note the position of the gauge link. Its indices are contracted as before. 




(2.68) 




(2.69) 
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-Jq'/q(x) (lO aja + fq'/q( X ) {l^) af} 



(2.70) 



in analogy with Eqs.(2.54) and ( 2.60| ), we see that this density matrix contains the antiquark 
probability distributions 



fq'±/q+ 



Tr 



(i± 75 )M i 



2 \ fq'/q( X ) ± fq'/q( X ) 



fq'/q( x ) — fq'+/q+ + /?'-/<?+ 

fq'/q( X ) = fq' + /q+ ~ fq'-/q+ 



(2.71) 
(2.72) 
(2.73) 



Here, the helicity label does not correspond to the helicity of the antiquark itself. Rather, it 
refers to the helicity of the quark related to the antiquark. For example, f q '+/ q +{x) represents 
the probability of finding an antiquark of flavor q' , momentum xp, and negative helicity in a 
quark of flavor q with momentum p and positive helicity. 

Since quarks and antiquarks are so closely related, it is customary to combine their contri- 



butions into the same distribution. In light of Equation (2.69), the relationship 



M a p(-x) M a p{x) 



(2.74) 



is valid in the singular axial gauge, n ■ A — 0, provided that we ignore the anticommutator.p| 
Since we have not at this point attributed any meaning to the functions f q i / q {x) on the negative 
real axis, we can define 



fq/A-x) = -fq/r(x) 

fq/T(-x) = -fq/ T {x) 



(2.75) 



for x > 0. This is not in any way a statement about the analytic properties of f g i / (%)P1 
It is merely a convention to simplify the appearance of our equations and acknowledge the 



similarity between quarks and antiquarks. Using this notation, Eq.(2.68) can be written in 
the compact form 



dxf q/T {x) = N q/T 



(2.76) 



54 It can be shown that this term vanishes in a particular quantization scheme. See |52| . 

55 The operators which are able to generate connected contributions are quite different in the two cases. 
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The vector nature of QCD allows one to construct the sum rule 



dx f q±/T (x) = Nf /T (2.77) 



using arguments identical to those that led to (2.76). These new quantities can be used to 
define the net number of positive and negative helicity quarks of each flavor in the target T. 
According to ( 2.65| ) , this implies 

K/t + N ;/t= [ dxf q/T (x)= N q/T (2.78) 



i 

N t/T ~ N «/t = J dx f q/T (x) = AN q/T . (2.79) 

As mentioned above, N q /x is conserved separately for each flavor. This fact follows from the 
resultQ 

S M <T(^ 9 (») 7 ^ 9 (x)))=0 (2.80) 

for physical matrix elements. 

The analogous relation for AN q / T is broken by quantum effects J53|. However, these effects 
are the same for each species of quark. This makes the difference AN q/ i T — AN q / T independent 
of QCD dynamics. Systematically, one can define the nonsinglet distributions 

i 

Pns/t(x) = ^2fj/T(x)-if i+ i/T (2-81) 
i=i 

i 

firs/A*) = Y,h/T(x)-ifi + x/T ■ (2.82) 

In a theory of n/ approximately massless quarks, the rif — 1 nonsinglet distributions represent 
purely quark effects. These distributions are immune to the dynamical anomaly that alters 
AN q / T , so the objects 

i 



&N l NS/T = j ^ dxf NS/T (x) (2.83) 
are physical quantities which are conserved to all orders of perturbation theory.^] The orthog- 



56 This identity is proved in Appendix D.2; it follows from QED gauge invariance. 

57 This still doesn't put them on the same level as the N q / T . These quantities are conserved separately 
regardless of quark mass effects, while conservation of the nonsinglet AJV's relies heavily on the assumption 
of massless quarks. Even N q / T is altered by the flavor-changing weak interactions. Only the singlet quantity 
Ng/ T corresponding to the total number of quarks in the target remains inviolate in the full theory. 
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onal degree of freedom is composed of the sum of all flavors, 

71/ 

f s/T (x) = J2f j/T (x) (2.84) 

3=1 

f s/T (x) ee ^fi/rix) . (2.85) 

3=1 

This singlet 58 distribution is not immune to the effects of the dynamical anomaly that alters 
AN. This implies ANs/t, which represents the difference between the number of positive and 
negative helicity quarks irrespective of flavor in the target, is dynamically dependent. This 
manifests itself in a dependence on renormalization scale. Since it is not loyal to any particular 
flavor, the effects of this distribution cannot be distinguished from those of the gluon field. As 
we will see, this fact is intimately related to the presence of infrared divergences in the gluon 
amplitude. 

The soft physics relevant to gluon contributions to D1S comes in two different forms. On 
one hand, gluons can fluctuate into quark-antiquark pairs which then participate in the hard 
scattering. In this case, the fluctuation occurs long before the hard scattering and cannot 
truly be accounted for perturbatively. This leads to a term of the form fq/ g t q v in T£ v . 
The distributions relevant to this contribution are contained in Equation ( [2.54 ) and its anti- 



counterpart (2.69), with gluonic external states. Defining 



M q/gv (x) = \f q/g {x) ^+\~f q/g {x) 1 ^) (2.86) 
M q/gn (x) = \f- q/g {x) i>+\f- q/g (x) 1 ^) , (2.87) 



where rj — ±1 is the gluon helicity, we have 



/*/.(*) = />/»(*) < 2 - 90 ) 



58l Singlet' and 'nonsinglet' refer to the transformation properties under the flavor group, SU{nf). Since the 
gluon fields are invariant under the action of this group and the nonsinglet distributions are not, these two 
sectors have no business with each other. The singlet distribution, on the other hand... 

59 Note the extra factor of 1/2 accompanying /. This compensates for the difference in spin between gluon 
and quark. 



113 



h/g(x) = -f q / g (z) ■ (2.91) 

On the other hand, gluons can enjoy their own hard scattering coefficient. Here, the 
fluctuation into a quark-antiquark pair occurs on the same time scale as the hard interaction, 
creating an effective photon-gluon coupling. This contribution will have the form f g / g t^ u . The 
parton distribution which appears here must be expressed as a matrix element of a gluonic 
operator. The road from idea to matrix element is complicated in this case by the nonlinear 
transformation law for the gluonic fields. Here, we need somewhat more than a gauge link 
to form a gauge invariant distribution. In fact, the only local object which annihilates the 
one-gluon state and has linear gauge transformation properties is the field strength T^ v . This 
implies that the matrix element 



= / ir e ^ XX (An) Gn (An, 0) F# (0)| pr,) 



dX 

g/a v ; j 2lT 

= -xf g/a (x) (g» a P v p p - g^v> a + g^p"p a ~ 9 va P»P P ) (2.92) 
+ir)xfg/ a (x) [ e ^ s p u pl 3 - £ ^ s p u p a + e"^ s p^p a - e^V/) n-yPs , 

contains the vacuum as an intermediate state. Q is the same gauge link as that in the quark 
distribution operator, with the generators T° appearing in the gluon fields interpreted in the 
adjoint representation.^ The normalization has been chosen so that the scalar distributions,^] 

f g/a (x) = J ^e- iXx {pq\^ a (Xn)G n (\n,0)T\ (0) \vn)n p n v , (2.93) 

f ~ 9/a{x) = ~2^J ^^(P^ ( Xn )^n (An,0)^ Q (0) \pnWnu , (2.94) 
which can be isolated by contraction, have the leading behavior 
f g/g (x) = ^[S(l-x)+S(l + x)} + 0(a s ) 

fg/g(x) = | [S(l -x)- 5(1 + x)] + <D(a s ) . (2.95) 

As before, These distributions can be calculated in perturbation theory by introducing 
new fields with eikonal propagation and inserting a complete set of states.0 The physical 
polarization vectors for gluons moving in the 3-direction are given by — (0, 1, it), Q)/V2. 

60 See Appendix These indices have been suppressed as in the quark distribution operators. 

61 The dual field strength is defined by T»» = \ e^ 1 "* 13 T a g. 

62 Since gluons are bosons, the creation and annihilation operators in each gluon field are of the same species. 
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Since this distribution appears only with t^ v , a quantity that starts with a s , we do not need 
the next-to-leading behavior in our calculation. This also implies that we need not calculate 
the 'gluon distribution in a quark', f g / q , since this object can only contribute to Tjf v at order 



To see the connection between the above distributions and the idea of removing one gluon, 
we turn again to the singular light-cone gauge. As before, the gauge link is trivial in this 
gauge. Furthermore, this choice reduces n a T a ^ to n ■ dA^ . Introducing a complete set of 
intermediate states, we see that f g / a (. x ) becomes^ 

f g/a (x) = -i fp e -^J2^\in -dAZ {\n)\X){X\in- cU£ (0)| ph) 

:J25(p x -n-(l-x)) {ph \A«(0)\X) (X \A^)\ph) (2.96) 



x 

X 



Aside from the factor of x, this is exactly what we would have written down for the unpolarized 
gluon distribution if we did not have to worry about gauge invariance.p^ However, a proper 
normalization must take the issue of gauge invariance into account. If we wish to normalize 
all of our distributions in an unambiguous way, it is necessary to relate them to a physical 
observable. The conventional choice is the total + momentum carried by a certain parton 
in our target. Interpreting the parton distributions as probabilities, we would conclude that 
this object is simply a sum over the probabilities weighted by the momentum carried by each 
parton : 

n-P a / T = / dxxf a/T (x) . (2.97) 
Jo 

On the other hand, the operator representing the momentum carried by various parton species 
can be obtained from the stress-energy tensor of QCD. According to the analysis of Appendix 

This leads to the 5(1 + x) terms above, which do not appear in the fermion case because the action of antiquark 
creation and annihilation operators cannot generate connected diagrams at this order. Note that this bose 
symmetry can be taken into account simply by requiring (anti)symmetry in the argument x for the (un)polarized 



distribution. This is a general property of the matrix element (2.92). 
63 neglecting the bose symmetry 

64 The factor —g a p/2 averages over transverse gluon polarizations with the weight +1; remember that in 
axial gauge A ■ A = —A± ■ Aj_. 
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D.2, we have 



P» = I d 3 xO 0fl (x) (2.98) 

e"" = e^ + e^ (2.99) 

e% u = m v) i) (2.100) 

ef = |5^ 2 - jt^jt^ . (2.101) 

Lorentz invariance implies 

( p \ ®Z \ p ) = 2a i,9 P>lpV + 2^, 9 W 2 /" • (2.102) 
Since |P) is an eigenstate of P M , we havepl 

P"(2P°) (2tt) 3 (5( 3) (0) = (P| / d 3 xe°^(a;)|P) 

= (P|e ^(0)|P)(27r) 3 (5( 3 )(0) . (2.103) 

This immediately tells us that 

a q + a g = 1 b q + b g = , (2.104) 

which promotes the identification of a q ^ g with the fraction of momentum carried by quarks 
and gluons, respectively.^ These parameters are pure numbers^ and may be obtained in any 
way we wish. In particular, from Equation ( 2.102| ), we see that 



Vie++|P) . (2.105) 



Our normalization condition, Eq.(2.97), then takes the convenient form 



\ (P\ \ p ) = I dxxf w/T {x) . (2.106) 



65 Our states are normalized so that (P'\P) = 2P°(27r) 3 <5(P' - P). 

66 The quantity b q , g should not bother us too much here. Its presence can be understood in the rest frame 



of the nucleon, where Equation (2.102) would otherwise imply that no species of parton can have nontrivial 
momentum dynamics. These kinds of transverse effects are suppressed by powers of Q 2 in the Bjorken limit; 
in any case, the contribution of b to the + momentum of the proton for any appreciable value of the boost 
parameter A is quite negligible. 

67 ignoring, for the moment, their dependence on renormalization scale 
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It is now easy to see why the 'extra' factor of x adorning the gluon distributions is needed .p^| 
Although the correctly normalized gauge-invariant distributions are given by f g /T an d 
fg/T, our perturbative calculation in the last section was performed with single gluon external 



states. For these states, distributions of the form in Eq.(2.96) are required. However, it is the 
proper distributions which will survive the transition from parton external states to physical 
hadronic ones. We can account for the difference between these two distributions simply by 
dividing our perturbative results by x since 



dX 
2^ 



^-e- tXx (PS \A^(Xn)Al(0)\PS) 



2 1 i 



d- 2 x 



-9±fg/T(x) + 



fg/T(x) 



(2.108) 



in the light-cone gauge. 

Armed with perturbative expressions for our parton densities, we are now ready to extract 
the hard scattering coefficients. Exploiting the fact that we work only to next-to-leading order, 
we write 



4tt 



4ttMo 



-6/2 



'iR 



(rr) (D 



S C F 2 fn 2 e~< E \ 



-e/2 



W) - Cw + ^;(iS) 



4tt {t * > 



(2.109) 
(2.110) 



for the quark amplitude. The infrared divergent term in T^ v has been designed to include 
only the infrared divergence as seen in the MS scheme. For example, the term 



2 (Q z e 



in T^ 1 " contributes 



O-sCj 

' 47r x B - 1 e \ 47r A*o 



XB 



1 



(2.111) 



(2.112) 



68 A somewhat easier, but less rigorous way to understand this factor comes from a different analogy with 
the quark case. Above, we have interpreted ip as the hermitian conjugate to tjj. If instead we think of it as the 
momentum canonically conjugate to ip, the the analogy with the gluon sector changes its nature completely. 
The canonically conjugate momentum to the gluon field is F, so we would naively write 



(2.107) 



for the (unpolarized) gluon distribution. In order to make this gauge-invariant, we must eliminate A in favor 
of T . The 'extra' factor of x compensates for the necessary derivative. 



117 



to (T^)£ and 



to {T^ v ^ . Note that we are considering scattering on a quark of momentum p rather than 
xp here. The perturbative coefficient functions are designed not to care. 



9^ e\ 



3 . Q 2 
T lo S — 



(2.113) 



Equating terms order by order in a s , we see that Equation (2.52) implies 



(T)2=E/_ l */f),(')(*?') (W w 



(2.114) 
(2.115) 
(2.116) 



Equations fl2.114| ) and Q2.115] ) are used to define (t£,") (0) and (^) (1) . Equation 02.1160 is a 
statement about the sources of soft physics in our calculation. If it is satisfied, all of the soft 
physics relevant to DIS is contained within the parton distribution functions. Otherwise, other 
sources of soft physics contribute to this process. This situation could imply that one cannot 
separate the hard and soft scales. Fortunately, this is not the case in inclusive DIS. Explicit 



substitution of the expressions above into ( 2.114 ), ( 2.115 ), and ( 2.116 ) and analogous relations 



in the gluon sector, shows that ( |2.116| ) is satisfied for polarized and unpolarized DIS on quark 
and gluon targets at order a s . The results can be expressed for any target T in the simple 



^2 fq/T{%, fJ?)C q 



Si(x B ) 
whereFl 



1 dx 

-l x 



x y 
x B ' A« 



fg/T(x^ 2 )C g 



x Q 2 



xb M 



(2.117) 



' xb — x 



a s (n 2 )C F 2 
4tt &q 



„ , Q 2 

3 - log -j 
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xb — x 



21og — 
i 1 



69 The leading antiquark scattering amplitude is just the negative of the leading quark contribution evaluated 
at — x, so these contributions can be combined into the same hard scattering coefficient. 

70 Note the extra factors of 1/x in the gluonic coefficient functions coming from the difference in definition 
between the properly normalized gluon distributions and the distributions relevant to our calculated amplitudes. 
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(jl 9. 

\xb ' M : 



.r 

x B /U 



C a — ,^ 




(2.118) 



(2.119) 



(2.120) 



(2.121) 



Renormalization has introduced violations of the simple scaling behavior we saw in the 
leading order amplitudes. The dependence on renormalization scale is illusory; the coefficient 
functions and parton distributions depend separately on /i in such a way that the product 
is independent. However, the dependence of our amplitude on Q 2 is quite real and can be 
measured in experiment. In the absence of asymptotic freedom, this behavior would persist 
even into the Bjorken region so scaling could never be experimentally observed. Asymptotic 
freedom ensures that scaling violations become smaller as Q 2 is increased. 

Since these scattering coefficients are completely insensitive to the soft infrared physics, 



(2.117) is valid for all targets. In particular, we can derive expressions for the quantities 
measured in real DIS on proton targets simply by taking the imaginary part of T\ and S\ as 
xb approaches the real axis from below .[^] The parton distribution functions are real, so the 



'At this point, we must analytically continue our expressions to the region \xg \ < 1. It is only in this 
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measured structure functions are expressed as convolutions of the unknown structure functions 
with known scattering kernels. At leading order, we have simplyf^] 



Fi(v,Q 2 ) = 1^24 [/ 9 /t(^,Q 2 ) + / ? /t(^b,Q 2 )] (2.122) 

9 

Gi(^Q 2 ) = lj2 e l[M x B,Q 2 )-f- q /T(x B ,Q 2 )] ■ (2.123) 

9 

The interpretation of this result is clear. At leading order, the photon simply couples to 
each parton completely incoherently. F± measures the total number of charge carriers with 
momentum xbP weighted by the square of the charge. Notice that the sum of the quark 
and antiquark distributions is relevant here, rather than the difference, since both contribute 
to DIS in the same way. Likewise, G\ measures the helicity of charged carriers weighted by 
squared charge. Since the antiquark distribution has been defined in such a way that fq/T 
measures the net negative helicity of antiquarks, it appears with a sign in this expression. The 
kinematical restriction xb — x corresponds to the requirement that the intermediate quark be 
'physical'. 

A measurement of the structure functions allows us to extract the parton distributions and 
obtain invaluable information on nonperturbative hadronic structure. In particular, integrals 
of the structure functions can tell us how much of a proton's momentum is carried by quarks 
or the total spin carried by quark helicity in tritium. The results of this section allow us to go 
even further and predict the leading QCD corrections to F\ and G\ . All that is needed is the 
imaginary parts of the coefficient functions, Eqs. Q2.118D , (|2.119| ), (|2.120D and (|2.12l|) .p| 

region that the correspondence between our amplitude and physical measurements can be made. 

72 We choose p? = Q 2 because this is the most natural value for it. The full expression is indeed independent 
- if we calculate to all orders. Our lack of motivation for this task forces us to choose in such a way that 
corrections are minimized. 

73 Taking the imaginary part of the logarithms in these expressions is somewhat tricky. For x < xg, 

. X B ~ X XB - X 

•Re log = log ■ 



XB 

while for x > xb, 



xb %B 
Qmlog XB - X = , (2.124) 



x B - x x - x B 

!Re log = log 

xb %B 

log XB ~ X = . (2.125) 

xb 
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Once the parton distributions are known at a certain scale, their scale evolution is fixed by 
perturbative QCD. This makes the study of scaling violations an especially clean way to view 
QCD. In fact, the prediction of these violations in DIS is one of the most precision tests of the 
theory. 

The scale evolution of our distribution functions is interesting in its own right. Physically, 
it arises for the same reason that a s (/ji 2 ) runs - things look different at different scales. From 
very far away, I may think that I am looking at a quark of momentum p. However, if I look 
closer by increasing my probing scale, I may find that that quark is really a quark of momentum 
xp and a gluon of momentum (1 — x)p. This line of reasoning leads to the evolution equation 

// 2 ^/ q/ t(x,M 2 ) = ^^E/ 1 dyfa>/T{y,tf) f dzP a ,^ a (z)5(x - zy)(2.126) 

Here, a s P a >^ a (x)dp, 2 /2irp, 2 represents the probability that a parton of type a' and momentum 
p will generate a parton of type a and momentum xp due to the infinitesimal change d/j, 2 in 
scale. This activity will add to the distribution f a /T{%)- The 'covariant' derivative, D, is 
designed to include the change in f a /T(x) due to partons of type a splitting into other partons. 
This is known as the endpoint contribution. Conceptually, it would seem that this term is 
just be some kind of sum over probabilities P a ^ a ' ■ However, classical reasoning does not work 
here. In reality, we do not have conservation of particles in a quantum field theory. The 
only conserved quantities here are those quantum numbers associated with some symmetry of 
our lagrangian. In this case, the endpoint contributions are fixed by flavor and momentum 
conservation. As mentioned above Pj conservation of flavor implies that 

dx [f q/T (x, p 2 ) - f WT (x, /i 2 )] (2.128) 



is independent of fj 2 . Using 



<^^V + i^ AQ < (2 - 129) 



—vk appears rather than because for ij slightly below the real axis, - i is a negative number with a 
negative imaginary part. Taking the branch cut along the negative real axis, we arrive at ( 2.i2E| ). 

74 At this level, it is conceptually easier to use the explicit forms of the quark and antiquark distributions. 
We can always simplify our expressions later using (|2.75[). 
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we see that this requirement leads t< 



Some of the relations 



.1,,= / ^[P q ^ q (y/x)-P q ^ q (y/x)} 



(2.130) 





= Pq^q'{x) 


(2.131) 




= Pq-^q'{x) 


(2.132) 




= Pq^gix) 


(2.133) 




= Pq^q(x) 


(2.134) 


Sqq')Pq^q'(x) 


= (l-5 qq ,)P q ^,(x) 


(2.135) 




= (l-5 qq ,)P q ^ q (x) 


(2.136) 


Pq^ 3 (x) 


= P q >^ 9 (x) 


(2.137) 


Pg^q(x) 


= Pg^q'{x) 


(2.138) 


Pq^qW 


= Pq'^Ax) 


(2.139) 



are necessary to derive this result. All follow from the CP and flavor invariance of the QCD 
lagrangian and are valid to all orders in perturbation theory.^] At the order to which we work, 
P q —> q is zero. Hence conservation of flavor requires 

dy 



A„ 



MI) 



(2.140) 



On the other hand, we have calculated f q i / q (x, ji 2 ) in perturbation theory. Manually taking 



the derivative of Eq.(2.62), we arrive at 
d 



M d^ fq/q( * X,fi ) 



g s (fi 2 )C F 
2tt 

2tt 



1 + x z 
1-x 



- 6(1 -x) dy 



i + y 2 



dydz Cf 



1 + z 2 

1 - z 



f q/q (y,H 2 )S(x-zy) (2.141) 



a s (n 2 ) [ x dy x 2 + y 2 2 
■ C F f q / q {x,V ) 



2vr 



x-y 



75 In these kinds of manipulations, it is often easier to use the primordial form of the evolution equation, 



(2.126), rather than the streamlined (2.127). Note that we must be very careful deriving this result since 
the integrals involved are singular. In particular, the form of P q ^ q (z) at leading order in a s (fi 2 ) leads to an 
anomalous dependence of A q on x. 

76 Quark mass effects break flavor invariance. As such, they will alter relations (2.136), (2.137), (2.138), and 



(2.139). However, this will not change (2.130) 
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at leading order in a s (fi 2 ). Comparing with ( 2.126 ), we see that 



P q ^ q (x) =C F 

Aq — Cf 



1 + x 2 
1-x ' 
x dy x 2 + y 2 
o x 2 x-y 



(2.142) 
(2.143) 



at leading order in a s . In light of (2.130), these two quantities are not independent. Since 
Pq^q (x) comes entirely from gluon intermediate states, the calculation of vacuum intermediate 
states is in some sense redundant. In the light-cone gauge, the absence of the gauge link causes 
the diagrammatic calculation to be incomplete. In order to obtain the full result in this gauge, 
one must impose ( |2 . 1 3 0[ ) 

We are now in a position to understand the divergent integral in A q . Its presence it required 
to cancel the divergent behavior of J P q ^ q ■ To display this explicitly, one often combines the 
two terms : 



E 



E 



q s(a* 2 ) f 1 dy 



2tt 



Pa' — >a 



-A a 5 aa ,S 1 - - 
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a s {n 2 ) f 1 dy 



2vr 



'a' — >a 



fa'/riy,^ 2 ) 

fa'^a(y,H 2 ) ■ 



(2.144) 



In the quark case, we can write explicitly 
V q ^ q {x)=C F 



The '+ function', 1/(1 — x) + , is defined by 



1 



, = lim 

(1 - x)+ e-+0 

Its definition implies 



1 - x 



@(l-x-e)-S(l-x) 



dy 



\x ™ 



il-x)- 



= tdxi^m 

1 — X 



(2.145) 



(2.146) 



(2.147) 



which is more useful in practice. 

77 This is true for a well-defined, consistent light-cone gauge choice. Due to the similarity between the pole 
prescription indicated in Fig. 2.4 and the PV-prescription, the pathological theory reproduces the correct 
end-point contribution at this order while ML does not. However, this does not reconcile our renormalizability 
issues with PV. 
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The other conservation law which must be satisfied by these distributions is that of 
momentum. We have normalized our distributions in such a way that 



V / dx x f a / T (x, fi 2 ) =n- P T 

n JO 



(2.148) 



is the total + momentum of the target. Since this cannot be scale dependent, we have the 
relation 

^2 j Q Xdx A afa/T(x,^ 2 ) = yj Q zdz Pa'^a(z)^j IJ dy } a , / 



/T {v,f)) • (2.149) 



Using the above relations once more, along with the fact that A q = A g is independent of flavor 
and of target, we arrive at 

V dy 



A, 



x x 



The first of these relations allows us to deduce A g without the use of vacuum insertions (or 
provides a useful check of our calculation) , while the second takes the form of an unexpectedly 
strict consistency condition on the allowed probabilities for quark splitting. This equation can 
be used as an extremely strong check of one's expressions. 

Our perturbative calculations of the parton distributions allow us to extract P g _ >9 (x) ex- 
plicitly by differentiating f q / g (x). The result, 

Pg-^{x) =T F [x 2 + {\-xf] , (2.152) 

allows us to write the full evolution equation 



2 D q 



D ^f q/T (x, f 2 ) = f ^ [P q ^ q (x/y) f g/T (y, a* 2 ) + P g ^ q (x/y) f g/T (y, M 2 )] . 

(2.153) 

This equation is complicated by the presence of the gluon distribution.^] As before, we can 
form nonsinglet distributions which decouple from the gluon sector using the diagonal genera- 

78 Note that the gluon distribution here has the leading behavior 

f g/g (x)~xS(l-x) (2.154) 



rather than Eq. (2.95) because we have chosen to eliminate negative arguments. In the quark case, this means 
that we must consider both a quark and an antiquark distribution. Since gluons are their own antiparticles, 
their distributions add. 
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tors of SU(rif). Due to the symmetry of our theory these nonsinglet functions all satisfy the 
same evolution equation 



2 D q , / 2n a s(H 2 ) 



2l7 



dy 



Pa 



-P, 



q-^q 



/jvs/t(j/,M 2 ) (2-155) 



to all orders in perturbation theory. Its appearance in this evolution equation as well as its 



role in the determination of A q make 



Pns(x) = Pq^q(x) - Pq^q(x) 



(2.156) 



deserving of the title 'nonsinglet splitting function'. 



Equation (2.155) is also satisfied by the nonsinglet antiquark distributions and the difference 
between the singlet quark and antiquark distributions. Only the lone full singlet distribution, 



/ s /t(z,M 2 ) = U<i/t( x i M 2 ) + fq/r(x, /J 2 )) 



(2.157) 



can mix with the gluon fields. Their evolution is coupled according to 
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<3 S (M 2 

2tt 
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2n s p 9^\-)h/T{y,^) 
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+ Pa 



where I have defined 



P s (x) ee 2n f P q ^(x) + P NS (x) 



(2.158) 



(2.159) 



(2.160) 



The coupling of the gluon and singlet quark distributions makes data on these objects extremely 
difficult to analyze. Since one needs to know both distributions entirely at one scale in order 
to evolve either to another, measurements at different scales are not directly related to one 
another. 

In the polarized sector, everything works in exactly the same way. The nonsinglet distribu- 
tions decouple from the gluons and each other to evolve autonomously, while the fully singlet 
distribution mixes with the polarized gluon distribution. Here, the definition of the antiquark 
distribution implies that it is the difference of the quark and antiquark singlet distributions 
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that forms the fully singlet distribution, rather than the sum. The endpoint contributions, 
being properties of the fields themselves, are identical to those in the unpolarized case. Only 



the evolution kernels are different. In Appendix A. 4, the full set of kernels at leading order in 
QCD is displayed. At next-to-leading order, one can find them in j43|. Note that Equation 



(2.151) is satisfied by these functions. 

The expressions ( [2.144 ) are called Dokshitzer-Gribov-Lipatov-Altarelli-Parisi (DGLAP) 
evolution equations in honor of the physicists who first derived them [54] , The functions 
T- > a'—>a{^)i known either as splitting functions or DGLAP evolution kernels, allow us to sum 
certain effects to all orders in perturbation theory. For example, although the probability of 
finding an antiquark in a quark is zero at 0(a s ), the nonzero splitting functions V q ^ g and 
Vg^q will generate a nonzero probability via the DGLAP equation as the scale is varied. The 
distribution generated in this way will not be the same as that generated by using the correct 
nonzero splitting function V q ^q, but it will certainly be closer than the direct leading order 
prediction of zero. This effect comes from our assumption ( [2.126 ) of the form the evolution will 
take. Once we assert this form, which is motivated on physical grounds, our equation already 
knows part of the result for the next order. The use of these kinds of evolution equations to 
sum certain contributions to all orders has been developed into a high art form and can be 
extremely useful in analyzing data. 



2.4 Factorization to All Orders in DIS 

Now that we have seen that the infrared divergences present in our one-loop amplitudes can be 
attributed to parton distributions in a parton, it is natural to ask if this property persists to all 
orders in perturbation theory. In this section, I will outline a general proof of factorization for 
inclusive DIS. This proof has been undertaken in a rigorous field-theoretic manner by several 
people [[ItJ ^8|, [55| . A very instructive recent application of the method under consideration 
here can be found in Ref . [Q . 

Our proof is structured as follows. First, we must identify the infrared-sensitive regions of 
our process. These regions can be characterized by certain reduced diagrams in which only the 
infrared behavior is emphasized. Since we work only at leading order in the Bjorken limit J 7 ^ 

79 At higher orders, the arguments of Section may no longer apply. Since these corrections are relevant 
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Figure 2.6: A representation of the all-orders scattering process. The nucleon 'blob' is intended 
to represent an all-orders sum, and the external photons are connected in all possible ways to 
this blob. 

we will concern ourselves exclusively with regions that contribute to the leading behavior 
of the amplitude. These regions have special properties which allow us to show that their 
contribution does nothing more than generate the distribution functions introduced in the last 
section. Since all of the infrared-sensitive regions of our process can be segregated into these 
nonperturbative distributions, the remainder of the amplitude is due entirely to high energy 
physics and may be reliably calculated in perturbation theory. 

To begin, we must find a way to isolate the regions of our process which are sensitive to 
soft scales. The easiest and most effective way to do this is to simply set all of the soft scales 
to zero. Since the perturbatively calculable hard scattering regions are insensitive to these 
scales, they will be unaffected by this procedure. On the other hand, the infrared regions of 
interest here will be forced to generate divergences. It must be emphasized that we are not 
assuming that the soft scales in our problem are zero. We are setting them to zero only as a 
means to identify the soft physics of our process. 

To this end, we consider a general Feynman diagram in massless QCD. Since we work in 
principle to all orders in perturbation theory, we will consider an external nucleon state which 
we draw as a blob (shown in Figure 2.6). The propagator structure of an arbitrary diagram 

for smaller values of Q 2 , they will involve processes in which the partons do interact with each other during the 
scattering. Depending on the degree of this mixing of scales, we may or may not have a factorization theorem. 
Some of the necessary considerations for these power corrections are discussed in Chapter ^. For more detailed 
studies, especially with reference to factorization theorems, see fej]. 
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may be combined into one denominator using a technique of Feynman's :0 

$3oi*?(^)+ie ■ (2.161) 

i 

Here, the Oj are Feynman parameters and the fcj are propagator momenta (which are re- 
lated linearly to the loop momenta Infrared divergences come from regions of the joint 
parameter-momentum space where ( |2.161 ) vanishes. Not all of these regions can give an 
infrared divergence. Consider the integral 

dx 

= log 2 . (2.162) 

x — 1 

The integrand surely diverges at x = 1, but the integral itself is finite.^ This is because the 
contour of integration can be deformed away from the singularity free of charge. These kinds 
of integrals can lead to branch cuts in our amplitudes and contribute to the imaginary part, 
but they can never give infrared divergences. 

There are only two ways an integral can lead to an infrared divergence. The first is if 
the integrand is divergent at an endpoint of the integration domain. In this case, we cannot 
deform the contour away from the singularity. An example of an endpoint singularity is given 

by 

/ — . (2.165) 
Jo x 

This integral is unavoidable infinite. Note that this is the kind of integral that generates the 
light-cone singularities in the PV-regulated axial gauge. 

The second way to obtain an unavoidably infinite integral is by pinching the contour so 
that it cannot be deformed away from the singularity, for example 

lim f ^ = lim - Tan" 1 ( -) . (2.166) 

e^oJ_ 1 (x - is)(x + ie) e^o e \e J 



'see Appendix 



A.3 



'Mathematically, this integral is ill-defined. However, a prescription such as 



lim 

«->o 



dx f 3 dx 



x 1 Ji+S x 1 



log 2 (2.163) 



can be used to define the integral unambiguously. This is what the +«£'s in our propagators do. In this case, 
we would obtain an imaginary part from the prescription : 



3 dx 



x — 1 + ie 



log2 - in . (2.164) 
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This integral is also unavoidable infinite. However, note that the expression 

nl dx 



lim 



= -2 



(2.167) 



- +0 J -i {x + ie) 2 

is not divergent. This singularity is not pinched and therefore the integration contour may be 
deformed away from itj^] These two cases are the only ways a contour can be forced through 
a singularity, so these are the only kinds on singularities we need consider in our search for 
infrared divergences. 



Since (2.161) is linear in {ay}, singularities in this space cannotbe pinched. We can deform 



the parameter contour to avoid all singularities unless either 



Q-i — 







(2.169) 



for each i. In the first case, the singularity occurs at an endpoint so cannot be deformed 
away.Q In the second, ( |2.161 ) is independent of on so contour deformation produces no effect. 
This region of parameter-momentum space will produce an infrared divergence if it is pinched 



in momentum space. Since (2.161) is quadratic in ki, there are two poles for each ki. These 
poles are pinched only if they occur at the same point on the real line. The two zeros of a 
quadratic form D(x) are equal iff dD{x)/dx — simultaneously with D(x). The structure of 
Feynman diagrams is such that one can always choose loop momenta to appear with the same 

82 It may seem a little strange that this integral is not infinite. If we imagine adding up its contributions, 
we would certainly get infinity. However, we must not think of our manipulations in this way. What we 
are discussing here is the analytic structure of our integrals. The correct way to interpret this result is by 
considering instead 

f 1 dx 2 , 

lim / ; ^ = lim — = -2 . (2.168) 

a— >o J (x + a) 2 - a->0 a A — 1 

In some sense, it is only a numerical accident that lies between the two endpoints of integration. The true 
singularities in this family of integrals lie at a = ±1, the endpoints. This may seem like a swindle, but in reality 



it is the only way to interpret objects like (2.167). These kinds of expressions are ill-defined only in specific 



regions of our parameter space. The only sensible thing to do is evaluate them away from these regions and 



analytically continue the result later. This is exactly what was done in Section 2.2, where we took xg >>> x to 
simplify the calculation and analytically continued our amplitudes to xb < % at the end. 

83 The other endpoint, on = 1, requires all other parameters to be zero (since Feynman parameters are 



constrained by }] . cii = 1). If the denominator ( 2.161 ) vanishes in this case, = 0. Hence this also reduces 
to (|t6§). 
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sign in all of the propagators .0 Since propagator momenta always appear quadratically in 
the denominator, we can choose this sign to be positive.^ With this choice of routing, the 
condition for a pinched singularity in becomes 

53 = . (2.170) 

i 

The sum goes over all propagators through which £j flows. In order to meet the criteria for 



an infrared divergence, Equation (2.169) must be satisfied by each propagator and Equation 



(2.170) must be satisfied by each loop momentum. The conditions are collectively referred to 
as the Landau equations ]50| |. 

Equation ( p. 170 ) has a nice physical interpretation due to Coleman and Norton [ p8) . Since 
OLik% is proportional to the 4- velocity of propagator i, we can think of it as the distance 



traversed by the particle it represents in a time aifc".£j ( 2.170 ) states that as we go around 
the loop £j, we get back to the point we started from. Since propagators with on ^ are 
onshell (kf — 0), the Landau equations state that pinched surfaces come only from classically 
allowed propagation.^] Each solution of the Landau equations has a corresponding 'pinched 
diagram' in which all lines representing propagators with on = have been shrunk to a pointp] 
and all other lines represent onshell propagation. 

For DIS on a hadron target, the most general pinched diagram for T^ v is shown in Figure 
2.7. Here, we see that any number of (collinear) parton lines may connect the hard scattering 
to the hadron blob. However, the hard scattering itself must be considered a point interaction. 



Diagrams in which this is not the case cannot satisfy (2.169) and (2.170) simultaneously since 
two or more onshell propagators leaving the first photon interaction vertex must be moving in 
different directions and therefore can never (classically) recombine at the other photon vertex. 

84 This can be seen by inspection. 

85 All we are saying here is that one can always choose a momentum routing such that fc; = ^\ £j + q, where 
q is some external momentum and the sum extends over all loops whose momenta flow through 

86 Since the units of cti are arbitrary, we can consider it a frame-independent ratio of time to energy k9. 

87 We can obtain classically unallowed propagation in quantum mechanics since momentum eigenstates do 
not have well-defined spacetime locations. To see this in another way, recall that momentum conservation at 
each vertex is imposed after integration over the spacetime location of the vertex. 

88 This is because = implies no propagation. In this region of parameter space, the integral is independent 
of fc' 1 so represents an position eigenstate. 
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Figure 2.7: The most general reduced diagram for DIS. Straight lines represent collinear (or 
soft) partons of any species. 



Shrinking all offshell propagators to a point thus gives us a pointlike hard interaction. This 
implies that summing over all final states in DIS cancels the infrared divergences associated 
with final state interactions]^] Physically, we can understand this as a consequence of uniterity. 
By definition, final state interactions take place on the scale of nonperturbative QCD. For large 



Q 2 , the arguments of Section 2.2 imply that these interactions can only occur long after the 
hard scattering has taken place. As such, they cannot affect the probability that the scattering 
will take place. By the time final state interactions come into play, that decision has already 
been made. The only thing these interactions can affect is the final form of the outgoing 
physical state. Therefore, their effects must cancel out in the inclusive cross-section]^ 

Although all diagrams of the type in Figure 2.7 can contribute to infrared divergences, they 
will not all contribute at leading order in the Bjorken limit. The Bjorken suppression of various 
graphs may be deduced from Q 2 power counting. A detailed analysis reveals that only 
those diagrams in which the hard scattering is connected to the hadron blob by two 'physical' 

89 Without the sum over final states, we could not obtain the cross-section for DIS from . 

90 As Q 2 is lowered, the time the hard interaction takes grows until there is no longer a clear separation 
between the (not-so-)hard interaction and final (and initial) state interactions. At this point, the above analysis 
breaks down. These nonperturbative interactions certainly can and will affect the total inclusive cross-section. 
In our classification of reduced diagrams, the assumption of large Q 2 is reflected in the fact that we take 
point-like couplings for the photon interactions. Less virtual photons have spatially extended couplings which 
allow classical propagation between them, reinstating final state interactions. 
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parton propagators and any number of collinear scalar^] gluons give leading contributions 
in the Bjorken limit. The appearance of scalar gluons can be understood by considering the 
simplest case of one gluon interacting directly with the intermediate quark propagation^ The 
gluon insertion breaks the quark propagator into two, giving a suppression of Q 2 unless this 
can be canceled by the numerator factors. Since large terms in the numerator appear only 
with fa, leading behavior occurs only when the interaction involves fa. In light-cone gauge, 
this is not possible since fa is related to A + = 0. In covariant gauge, we can obtain fa by 
contracting the other side of the propagator with p^ : 

P» (-9^ + (1 - Y = ~U ■ (2-171) 

This procedure gives zero for strictly collinear gluons in light-cone gauged 

P 1 " (-g^u + P^riv + PuU^) = , (2.172) 

so we can say in either case that leading behavior comes only from scalar gluons. Generalizing 
this analysis to higher loops and other couplings leads to the above result. 

A simple way to understand why these diagrams are favored over others comes from light- 
cone dimensional analysis. Since the dimension of an amplitude is fixed, all soft mass dimen- 
sions must be compensated by the hard scale Q 2 . Assuming covariant normalization for the 
external states {(p\p) = 2p°(27r) 3 (5 3 (0)), every external wavefunction contributes mass dimen- 
sion — 1 . The collinear quarks and gluons in a soft hadron vertex have effective mass dimensions 
depending on their polarizations. A Dirac field ^ can be written as a sum of good and 
bad (ip-) components, where ip± = P±tp and P± = 57^7 • The good (bad) component has 
effective light-cone mass dimension 1 (2).0 The vector potential has light-cone components 
A + , A- 1 , and A~ , which have effective mass dimensions 0, 1, and 2, respectively For the 

91 Scalar gluons can be thought of as gluons whose polarization vectors are dotted with their momenta. I.e. 
the 'scalar gluon field' is written c* M .A''. 

92 Like Figure 2.2c in Section 

93 In these manipulations, we take p 2 = wherever it will not cause a divergence. 

94 Since we consider collinear quarks moving in the 3-direction, the free quark wavefunction is annihilated by 
•p 1 . This means that the 'bad' component cannot freely propagate. We will see in Chapter Q that this part of 
the quark field really represents a quark-gluon composite rather than a simple quark. 

95 As in the quark case, this is related to equation of motion constraints. 
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Figure 2.8: A graphical statement of factorization in the axial gauge. 



reduced diagram shown in Figure 2.7, the only soft mass dimension comes from the nucleon- 
quark-gluon blob. Using the above rule, we find it is Em^ — 2. Here, rrij is the light-cone mass 
dimension of the ith collinear parton line. Our scattering process requires at least two physical 
(to 7^ 0) partons, so Sto^ > 2. Since physical parton lines add soft dimension to our matrix 
element, they lead to suppression in our amplitudes. Only the scalar gluon field A + can be 
added without penalty. Hence, we see that the most general leading pinched collinear surface 
is represented by Fig. 2.7, with two parton lines with to = 1 and any number of scalar gluons 
connecting the nucleon blob to the hard scattering. Noncollinear lines cannot classically reach 
the hard interaction, so the only other possibility for a pinched singularity involves soft lines. 
Since zero is in some sense collinear to everything, these lines are in principle contained in 
Figure 2.7. However, we must consider them explicitly as they can lead to divergences that 
will not factorize into our parton distributions. 

The power counting involving soft quark and gluon lines is more subtle, and some discussion 
may be found in Ref. (5(| . Essentially, one applies a general theorem due to Kinoshita, Lee and 
Nauenberg (KLN) j5i| which states that infrared divergences cancel in the sum over initial and 
final states. According to the above arguments, soft initial state interactions are suppressed 
by Q 2 while collinear ones only serve to enforce the gauge-invariance of our process. Hence 
our sum over final state interactions only is enough to cancel the leading soft divergences. 
Because of this cancelation, we can consider any reduced diagram with soft lines connecting 
the hard scattering blob to the nucleon jet subleading. The situation here is discussed in detail 



in Stcrman |60 



In the light-cone gauge, where collinear scalar gluons do not exist, the jump from this anal- 
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ysis to a factorization theorem is in some sense obvious. Here, \& = ip and only those diagrams 
with two physical partons connecting the hard and soft physics contribute at leading order. In 
this case, factorization is represented graphically by Figure 2.8. However, the complications 
associated with renormalization in light-cone gauge^j make it desirable to consider covariant 
gauges. In these gauges, factorization follows from the Ward-Takehashi identities introduced 



in Appendix D.2. Diagrammatically, these identities tell us that the sum of all insertions of a 
scalar gluon is zero. This is true for all physical matrix elements of gauge-invariant operators. 
In particular, we can apply it both to the hard scattering kernel and to the eikonal line. A 
graphical illustration of this application is shown in Figures 2.9a and b. Since the right-hand- 
sides of these graphical equations are identical, we are free to make the identification in 2.9c. 
Physically, the scalar gluons are coherently absorbed by the hard jet, which is represented by 
an eikonal line. They do not possess the immense amounts of energy necessary to resolve the 
individual constituents of the jet, so only the total color charge and momentum are relevant. 
This kind of coherent absorption cannot affect the hard scattering in a nontrivial way; its only 
effect is to induce gauge invariance in the final result. As we saw in the last section, this is 
exactly the effect of the gauge link in our parton distributions. Seen from this angle, it is 
not surprising that the scalar gluons generate our distributions. The proof to all orders is 
by induction; applying the Ward identities iteratively, one easily arrives at the factorization 



theorem expressed in Equation (2.117) 



2.5 The Operator Product Expansion 

In the last three sections, we have shown that the hard and soft physics of the amplitude^] 

T^ = i J d 4 z e- iq - z {PS |T J"(0) J v {z)\ PS) (2.173) 

can be consistently separated from each other if the virtuality Q 2 = —q 2 » A/Aqcd- Fur- 
thermore, we have argued that this separation is independent of the target. Since the target 
dependence of this amplitude is contained entirely within the external states, it is natural to 



ab see Section 1.6 

a7 The difference between this expression and that in Eq.(2.22) is purely aesthetic, as can be shown immedi- 



ately by shifting the argument and changing z —* —z. 
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(a) 




(c) 

Figure 2.9: A graphical illustration of the use of Ward identities to show that scalar glu- 
ons generate the gauge link in our parton distribution functions. Dashed lines represent the 
propagation of a scalar gluon. 
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ask whether or not we can strip these states away and express factorization as an operator 
relation. This idea was first introduced by Wilson in 1969 p9) and has been used extensively 
in deep-inelastic scattering and other perturbative QCD processes. The fundamental idea is 
that for large Q 2 the integral in ( 2.173| ) has support only in the region of small z. This means 



that our amplitude receives contributions only from regions in which the operator product 
J^J V is nearly local, which suggests a kind of Taylor expansion of the operator J fJ, (z) about 
z = 0. In this way, we express the operator 

i ( d A ze- tqz TJ^{Q).r{z) (2.174) 



as an infinite series, known as an Operator Product Expansion (OPE), of local operators. 
For the product of two currents separated near the light-cone, the expansion is threefold. 
Primarily, it is a twist expansion, in which twist-2 contributions are leading whereas the 
higher twist terms are suppressed by powers of M/Q. Each term in the twist expansion 
contains an infinite number of local operators of the relevant twist. Finally, the coefficients of 
these operators (Wilson coefficients) are themselves expansions in the strong coupling constant. 



In this section, I will show how we can convert the results of Section 2.3 into next-to-leading 
order results for the Wilson coefficients of the OPE relevant to DIS. The Wilson coefficients 
for the unpolarized DIS process were first calculated at order a s in the MS scheme in [pll . For 



the polarized case, one can find them in |62 



To get a better handle on the meaning of the OPE, it is useful to construct the leading 



order Wilson coefficients directly from the operator, Eq.(2.174). Since we take z near zero, 
we will be concerned only with the most singular terms in this limit. By inspection, one can 
see that singular behavior occurs only when we contract two quark fields from the different 
currents together to form a propagator. 98 For z° < 0, we have 

i J d A z<d{-z°)e- lq - z J^{Q)J u (z) 

= i J d 4 ze(-z°)e- l ^| ^e ifc -'£ ^,(0)7^7^,0*) 



'This implies that we need not consider terms in the operator product that couple different flavors. 
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E ^(0)7^^7^(0) • (2.175) 



10+ 

q 



In the last line, I have integrated by parts" (ignoring a surface term) and performed the 
integrals over z and k. The result for z° > is identical to this one, but with q — > — q and 
[L ^ v. 



roc 



These results are very similar to those obtained in Section 2.2, as they must be. We can 
get those results simply by tacking on external states and allowing the leftover quark fields to 
contract with them. Alternatively, using 

i>+4 q 2 q- J 

we can turn this expression into the expansion 

(2q-id-d 2 



i f d 4 ze-^TJ^0)r(z) = ^Te^E 



r- 



)2 



n=0 

2 \ n 

" % (2-177) 



v i$-j ( -1q-id-d 

7 Q 2 \ Q 2 1 



involving only local operators. The operator d 2 cannot hope to compete with Q 2 in the Bjorken 
limit unless we intend to consider external states with incredible amounts of ambient energy, 
so we can safely neglect this term. On the other hand, if chooses its orientation wisely, 
q ■ id can survive this limit. 

Simplifying the Dirac structure and cleaning house a little, we arrive at 

i J d 4 ze- iq - z TJ"{0) J v {z) 



xf^^ q {id a7f3 +id 0la )(^^\ V 9 (2.178) 



99 The action of the derivative on the ©-function generates a contact term which can be subtracted by 
considering a modified time-ordering operation, T*. This technical point is discussed in Ref. |63| . Here, we 
will assume that the appropriate modifications have been made and ignore this term. 

100 In order to show this, we have to use the fact that we consider only diagonal matrix elements. As full-fledged 
operators, these objects differ by a total derivative. 
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71=1 

2 



(^ + *)^7 5 (^)%. (2-180) 
Q a ^ «\ Q 2 Q 2 ) 



Y^^ q {id alp -id pla )(^X ^ q (2.181) 



x 

oo 



-A^E^E'v^^)"^ ■ (2 - 182) 

Here, a primed summation indicates that only even or odd numbers are summed over, depend- 
ing on whether the sum starts with or 1. 

Contracting with we immediately see that the first two terms automatically conserve 
current. However, the last three operators do not satisfy the Ward identity. To see why, let us 
imagine taking diagonal hadronic matrix elements of our expression. Since the operators on 
the right-hand-side do not have any intrinsic dependence on q M , Lorentz invariance requires all 
of their vector indices to be carried by the hadronic momentum and all of their axial indices 
to be carried by the spin. This implies that (2.182) vanishes by the equation of motion,^] 



and (2.181) does so identically. In order to have some other behavior, we need to introduce 
another momentum into the problem - an action that invariably leads to other contributions 
which will combine with these to restore gauge invariance. The third term, fl2.180|) , retains 



the nonvanishing gauge dependent piece 



c 



P a S (2.183) 



after we have taken our matrix element. In the Bjorken limit, this tensor must vanish since S 
is parallel to P. Although it's more difficult to see, any deviation from this limit will imply 
other contributions to the amplitude. We will see in Chapter [| that these contributions do 
indeed cancel this gauge dependence. For now, let us consider only leading contributions and 
take this object as zero. 

101 which, to the order at which we work, is simply ifttp = 0. 
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Now that we have taken care of electromagnetic gauge invariance, it behooves us to consider 
chromodynamic gauge invariance. As it stands, none of the operators on the right-hand-side of 
our expansion are gauge-invariant. The reason for this is very simple - we are working only to 
leading order in QCD. The culprit is our replacement of ip{0)ip{z) with the Dirac propagator. 
To maintain explicit gauge invariance without calculating to all orders, we can introduce a 
gauge link between the two remaining fields simultaneously with the replacement. The effect 
of this procedure is a simple replacement of all partial derivatives with covariant derivatives. 

Stripping away the factors of q* 1 , we see that our expansion involves local operators of the 
form 

^ 7 Mij£>M2 . . (2.184) 



As explained in Appendix B.3, n-index tensors such as these can be separated into several 
disjoint pieces. Each of these pieces transforms separately from the others under the action 
of the Lorentz group. As a symmetry of our theory, Lorentz covariance restricts the possible 
divergences that can appear. For this reason, operators which begin in the same irreducible 
representation of a symmetry group will remain together in that representation after renor- 
malization. However, Lorentz invariance cannot impose relations between two different repre- 
sentations. The operators that appear in our expansion actually represent sums of operators 
in different representations, all of which renormalize independently of one another. Rather 
than this mess, we would prefer to separate our operators into groups which stay together 
after renormalization. The procedure for this separation is straightforward and outlined in the 
appendix. Here, we mention only that the tensors required for our purpose are those with the 
highest spin. All other structures are subleading. 102 In view of this, we define 

?0 MiM2-Mn = ^ 7 (mi j£>M2 . . . iV ^)^ q (2.185) 
9< 5^iM2-/i» = ^p q7 (^iv^ ■ ■ ■ iD^TsV* ■ (2.186) 

In terms of these operators, we can write our expansion as 

2 ^„2^'(2^x)( 2 ?w)---(29Mn) 



(Q 2 ) n 

n=0 V ^ ; 



102 This can be seen simply be taking matrix elements. The hadron's momentum and helicity cannot support 
any antisymmetric tensors and the traces all lead to powers of M 2 . 
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2 « 



Q 2 



'/3 



-2 ^ [ <T 



l 0£ 
Q 2 

q^q v 



(2.187) 



Q 2 

This is the desired expansion of the product of two electromagnetic currents in terms of 
an infinite series of local operators. Each operator appears with a power of Q to compensate 
its mass-dimension and a power of q a /Q for most of its tensor indices. Removing trivial 



kinematical factors, a general operator of spin s and mass-dimension m appears in our 



expansion appears a£ 
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(2^)(2^ 2 )---(2^J 



1 



(2.188) 



{Q 2 ) s Q m - s - 2q 

Taking matrix elements with momentum P such that P ■ q = v, we see that the contribution 
of this operator is suppressed by Q m ~ s ~ 2 . Since it governs the suppression of these operators, 
the twist 

t = m-s (2.189) 

is of great importance. A look at Equations ( 2.18 5| ) and ( [2.186| ) tells us that the twist of the 
operators in our leading expansion is (n — 1) + 3 — n = 2. Hence these operators are not 
suppressed in the Bjorkcn limit. 

As mentioned above, operators of different spin cannot mix under renormalization due to 



the Lorentz invariance of our theory. Likewise, the absence of a scale L05 in massless QCD 
prevents operators with different mass-dimension from mixing with one-another. This implies 
that only operators of the same twist can mix. An exhaustive classification of QCD operators 
in terms of twist |p4j reveals that the lowest possible twist is twist-2. Furthermore, the only 



103 This amounts to ignoring each index that is 'bogged down' in a tensor structure involving fiu, and along 
with it a power of Q. In addition, factors of q a adorning these tensor structures should also be ignored along 



with a corresponding factor of Q. By taking matrix elements and comparing with (2.28), one can see that this 



is the proper procedure to determine the suppression of the structure function itself. 

104 This is purely a dimensional argument. Note that the mass- dimension of our operator J T J J is 2. 

105 The scale generated through the coupling by dimensional transmutation is of no help here. Its effect is 
limited to producing scale variations; /i 2 can appear only in a ratio. 
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other operators of twist-2 in the nonsinglet sector, 



(2.190) 



must completely decouple from our operators in massless QCD because of their chiral-odd 
structure. 
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Since the operators (2.185) and (2.186) have different transformation properties 
under parity, they cannot mix with each other. This leads us to the result that all of our 
operators must have diagonal matrix elements which evolve multiplicatively. ^ Hence if we 



can extract the value of one of these matrix elements at any scale, we automatically know it 
at all (perturbative) scales. This represents a tremendous simplification from the results of 



Section 2.3, where we found that knowledge of the entire parton distribution at one scale is 



necessary to evolve any part of it to another. Unfortunately, we will see that this simplification 
is not devoid of distasteful attributes. 



To see the relation between our expansion ( 2.187 ) and the structure functions of ( 2.28 ), 
we must take forward matrix elements. Using Lorentz symmetry, we can express the matrix 
elements of q O and q O in terms of pure numbers : 



P 



P) = AalMS^ 1 ■ •■P* 4 " 



In this way, we arrive at 



^ 2 H 4 E' (eW) 

^- a §(?*)(u)' 



(2.191) 
(2.192) 

(2.193) 
(2.194) 
(2.195) 



These equations express a direct relation between the structure functions of the Compton 
amplitude and the matrix elements of local QCD operators. In particular, barring belligerent 

106 Here, we consider only operators with nonzero forward matrix elements. We will meet other twist-2 
operators in the next chapter. 

107 T nese operators relate left- and right-handed quarks. As such, they cannot contribute perturbatively in 
massless QCD. 

108 Once gluons are introduced, we will that the flavor singlet quark operators mix with the gluonic operators. 
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behavior on the part of a n and a n , they tell us that our structure functions are analytic 
functions of ig near xb — oo. In addition, the hermiticity of q O and q O implies that our 
structure functions are real for on the real axis. This means that the true measured 
structure functions, F\, F2, and G\, can only be nonzero in nonanalytic regions of the ig- 
plane. In these regions, branch cuts can allow discontinuities across the real axis which generate 
an imaginary part. This fact implies that our operator product expansion cannot converge 
in the physical region. However, it can be used as a powerful tool to study the analytic 
structure of the Compton amplitude. Moreover, properties such as the Callan-Gross relation, 
T2(v,Q 2 ) = 2xbTi(i/,Q 2 ), can be studied more easily from this vantage. 

To go further, we must find some way to relate the scalar matrix elements to one-another. 



This is done following a hint from Section 2.2. There, we found that the integral of x times 
a structure function is related to the matrix element of a local operator appearing in Q^. 
Generalizing this procedure, we write 

dxx 11 ' 1 f q / T (x) 

_ 1 
~~ 2 
1 

~~ 2 
_ 1 
~~ 2 
1 



dx—e iXx 






2tt 






d 






2tt \i 


dX 


dx^e iXx 






2tt 





(PS\ij(0)g n (0,\n)?iiP(\n)\PS) 



PS 



= -(PS 



ip(0)G n (0, Xn)ji (n ■ iP(An))"" 1 V(An) 
PS 



PS 



(2.196) 

where we have ignored a surface term. Performing the same manipulations for the polarized 



distribution and comparing with ( 2.191 ) and ( 2.192 ), we see that 

dxx n - 1 f q/T (x) = al (2.197) 
dxx n ~ l f q/T {x) =a« . (2.198) 

Integrals of this form are called moments. Mathematically, the correspondence between 



and f q /x(x) is contained in Mellin transformation. Substituting these expressions into (2.193), 
(2.194), and (2.195), switching the order of summation and integration,^] and re-summing, 



3 1 refuse to justify this mathematically. 
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we arrive at 

Ti{v, Q 2 ) = ( — — _JL_ V e 2 q f q/T (x) + (x B - -x B ) (2.199) 
J x xb xb—x H 

T 2 (i/, Q 2 )= I— 2x 2 — V e\ f q/T (x) - (x B - -x B ) (2.200) 
J a; xb — x — ' " 

Si(^,Q 2 ) = / —a; — V^/^W-^^-xb) . (2.201) 

These expressions are familiar from Section [T^, as they should be. At this point, we are free 
to take imaginary parts and obtain the physical structure function s| llu | 

FMQ 2 ) = \Y J e l[fi/T^B) + f- q /T{xB)\ (2.202) 

9 

F 2 (v,Q 2 ) = x B J2 e2 q [fM x B) + f7 } / T (xB)] (2.203) 

9 

GMQ 1 ) = lY. e %/T(xB)-k/T(x B )] ■ (2.204) 

9 

Note that we have introduced the antiquarks. Their inclusion does not significantly alter the 
derivation above. 



Since we have only considered the leading order in a s , Equation (2.187) is guaranteed to 
have QCD corrections. A calculation in the spirit of the above derivation would invariably 
lead to arthritis. Alternatively, we can turn the above analysis on its head and use the results 



of Section 2.3 to obtain these corrections. Our amplitudes have the form 

f dr. 

In the unphysical region xb » x, we write 



tt> J x \ XB 



(2.205) 



(2.206) 



Substituting our expressions for the moments, we obtain an expansion analogous to (2.193). 
Writing 

(2g Ml )(2g Ma )---(2g M „) JJMlJJM2 p „ n 



1 

x B 



(Q 



2\n 



(2.207) 



110 The mathematical indiscretion above generated these imaginary parts. It can be justified in the unphysical 
region where the expansion converges. Once this has been done, the re-summed unphysical amplitudes can be 
analytically continued to the physical region. 
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and interpreting the resulting expressions as matrix elements of the local operators ( [2.185|) and 
(2.186), we can strip away the matrix elements and arrive at the operator product expansion ) 1 n | 



i / d 4 ze iq - z TJ"(z)J"(0) 



q»q v 



E 



' (2? M )---(2^J 



71=2 



(Q 2 ) n 

E 2 q 
e <+n 



2 w (2g„ 2 )---(2g M „) 

n=l 



(Q 2 ) 



2^-1 



. ? \ q / 



(2.208) 



I have ignored the longitudinal contribution, although it appears at the twist- 2 level, because 
it is kinematically suppressed. The gluonic operators, defined by 



(2.209) 
(2.210) 



are related to the gluon distributions in exactly the same way as their quark counterparts. 
These operators are also twist-2 and mix with the singlet quark operators, as we will see 



below. The coefficients of the expansion are given by 112 

cx s Cf 



1 



4tt 



a s CF 

47T 



9-~ 



n n + 1 



+ 4S 2 (n-l)-4T 1 1 (n-l) 



S l{ n -I) j 3 + - + ^- 
n n + 1 



+ - + 45 2 (n - 1) - 4T 1 1 (n - 1) 

n + 1 n 

/ 9 2 
5i(n-l) 3 + -+ " 



„5 _ 



a s T F 
2tt 

a s TF 
2tt 



-25i(n-l) 



rt n + 1 
1 2 



?i + 2 
[2 + 251(71- 1)] 



n n + 1 n + 2 
2 



n Ti + l 



(2.211) 

(2.212) 
(2.213) 
(2.214) 



111 The factor does not appear as the quark coefficient to all orders. At higher orders, it is only the singlet 
and nonsinglet distributions which make sense. For these distributions, one can factor out an electromagnetic 
charge factor to all orders. Since the distinction is irrelevant at this order, I have not made it. 

112 These coefficients do not agree with those in [[3 because their choice of 75 scheme is different from mine. 
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where I have introduced 



Thn) 



i=l 



(2.215) 



These coefficients can be obtained simply by expanding the hard scattering kernels about 
xb = oo. Note the implication that the kernels are finite in this limit. This is one of the 
requirements of a local operator product expansion. 

Since the moments of the distribution functions are mathematically equivalent to the func- 
tions themselves, 1 ^ it is interesting to study their evolution. According to ( [2.126 ), we have 

^^<(^ 2 ) = ^^lljy z ^ 1 ^a{z)a:;( j ?) . (2.216) 

Hence the evolution of the moments is governed by the anomalous dimension 



la 



dzz n - L V a ^ a (z) 



In the nonsinglet sector, we have the simple result 

1 



Ins = c f 



n(n + 1) 



201 (n) 



2 \ a >7»s/ 2 " 



(2.217) 

(2.218) 
(2.219) 



< 8 (A) ■ 

This explains why 7 is called an anomalous dimension. a n is dimensionless, but it varies with 



scale as though it had dimension a s 7j^ s /7r. 114 7 is related to the renormalization constant 
required to remove its ultraviolet divergences. One can think of it as the residue leftover by 

113 The Legendre polynomials are finite linear combinations of {xi}, so we can always trade Mellin moments 
for Legendre moments. Since the latter form a complete basis of L2 on the interval [—1, 1], we can fix the distri- 
bution functions in L2. To do better, we determine endpoint contributions from the conservation requirements, 



Eqs. fl2.f30[ ) and ( gT50| ). 

114 If we take the leading variation of a 3 (fi 2 ) with /i 2 into account, the factor 



a»7S s /2vr 



a s (p. 2 ) 



(2.220) 



However, since we only use the leading value of 7j} g , this represents only a partial resummation of higher order 
effects. This so-called 'leading log' approximation is used extensively in fits to data. In any case, the two 
factors coincide in the limit of small a,. 
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rcnormalization. Note that j^s — f° r au values of n. This implies that as the scale is raised, 
less and less of the operators in our expansion are important. It reflects the fact that as we look 
closer and closer at a parton distribution, more and more partons appear. The more partons 
there are to share the total momentum of our target, the less momentum each one has. Hence 
as the scale is increased, the parton distributions become sharply peaked at the origin. Since 
each higher moment penalizes small- a; contributions more than the last, it sees less of the bulk 
of the distribution. One more important aspect of our result is that = 0. Since the first 
moment of the quark distribution is conserved, its anomalous dimension is constrained to be 
zero. 

In the singlet sector, the quark and gluon operators mix with each other. Mathematically, 
this is because they contain divergences that they can't absorb by themselves. Physically, it is 
because 'quark' and 'gluon' are not good quantum numbers in this context. If we want some- 
thing to evolve independently of other things, it had better be an eigenstate of the evolution 
operator. Since the gluon and singlet quark fields fluctuate into one another, it doesn't really 
make sense to separate them in this way. To find out what does make sense, one can calculate 
the anomalous dimension matrix, 

where 



(2.221) 



1. 
7" 

Isg 
Igs 

7" 
19 



C F 

2nfTp 
C F 



+ 



2 n(n + l) 
1 2 

n 



25i(n) 



1 



+ ■ 



(n + l)(n + 2) 
2 



2C A 



n+l n(n — 1) 
1 



1 



Si(n) 



4n f T F - UC A 



(2.222) 
(2.223) 
(2.224) 
(2.225) 



_n(n-l) (n + l)(n + 2) 
and diagonalize it. The eigenstates of this matrix evolve independently of one another. If we 
now invert the Mellin transform, we will find ourselves with two singlet distributions which 
make sense (at this order) independently of one another. In particular, the evolution of the 
second moment is governed by the matrix 



7 2 = 



4 C F 



(2.226) 
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This matrix has the eigenvalues 
7l 2 = 



(2.227) 



Since the second eigenvalue is negative, the associated eigenvector will be 'run into the ground' 
as the scale is increased and the only relevant combination becomes that associated with 
the eigenvalue zero. Since the second moment of our distributions represents the fraction 
of + momentum carried by quarks and gluons, this eigenvector tells us these fractions at 
asymptotically large momentum scales : 

2N c njT F Snf 



P q (oo) 
P 9 (oo) 



2(iV c 2 - 1) + 2N c n f T F 16 + 3n f 
2(N* - 1) 16 



(2.228) 
(2.229) 



2(N% - 1) + 2N c n f T F 16 + 3n/ 

This result was first discovered by D. Gross and F. Wilczek in 1974 |6jJ. Since there are 
(TV 2 — 1) gluons each with two physical polarization states and N c nf quarks each with two 
polarizations and 'strength' Tp, this result has the simple interpretation of equal momentum 
sharing among all of the relevant degrees of freedom. Corrections to this result vanish as 

-(16/9+n / /3)a s /2 7 r 



(2.230) 



as the scale is increased. 
The polarized matrix, 



r 



Is I sg 
\ Igs Ig . 



(2.231) 



7? 



r 



7, 



y* 



Ins 



= 2n,T F 



C F 



= 2C A 



n 
2 



1 



-5i(n) 



An f T F - \IC A 
6 



(2.232) 
(2.233) 

(2.234) 

(2.235) 



n(n + 1) 

is quite unrelated to 7". This is because the separate helicity of the gluon and singlet quark 
fields also does not make sense by itself; only the combination is important. Since this is in a 
different sector of the theory, there is no reason that the two eigenstates should be related. In 
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particular, the unpolarized anomalous dimension matrix in the singlet sector is divergent for 
n = 1 owing to the fact that the first moment of the gluon distributions is either not local 
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or not gauge- invariant. L 16 However, the polarized matrix 

/ \ 

7 1 = (2.236) 
VfCir \{\\C A -An f T F )j 

is a singular but perfectly finite object. This matrix implies that the singlet hclicity is con- 



served^] while the 'gluon helicity' grows with the positive anomalous dimension 

= • (2-237) 



This number is, of course, familiar to us as the leading coefficient of the /3-function. It can 

be shown that this behavior is canceled by an equally large orbital contribution in any truly 

physical quantities^ [ p6[ . The steady-state solution, 

_ *^ 
6 

~a{ = | C F , (2.239) 

is not as useful as the unpolarized result because we have no way to normalize it short of 
experiment. 

2.6 Summary of DIS 

In this chapter, we have studied one of the most important tools of modern particle physics - 
deep-inelastic scattering. In a special kinematic limit, this process can be considered incoher- 
ent; one can calculate the cross-section independently for each constituent of a complicated 
bound state and simply average over the constituents to obtain the total cross-section. This is 
due to a separation of scales between the soft physics that dominates the bound state interac- 
tions and the hard physics relevant to the virtual photon scattering. Using analytic arguments 

5 Since factors of x turn into derivatives, we can think of factors of 1/x as turning into integrals. 

116 Instead of living with a nonlocal distribution, we could choose to of cancel the x by working in the light-cone 
gauge. This process leads to a local operator, but one that has no readily accessible physical interpretation. 

117 The anomaly does not make its presence known until the next order. 



118 Note that according to (2.214), this first moment of this distribution does not appear in polarized DIS. All 



higher moments have negative anomalous dimensions, as they should. 
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about the generation of singularities, one can show that this factorization of hard and soft 
scales persists to all orders in perturbation theory. 

The weighting of the average over constituents is done via parton distribution functions, 
which can be shown within QCD to be related to probabilities for finding partons within our 
target hadron. Although these distributions are nonperturbative and cannot be calculated 
reliably using contemporary methods, they are process-independent and appear over and over 
again in different experiments. As such, these distributions can be measured in one experiment 
and used to predict the outcome of another as a test of the theory. Moreover, the information on 
nonperturbative hadronic wavefunctions they contain is interesting in itself. This information 
can be used to test the accuracy of models, gain insight into hadronic structure, and help make 
contact between the infinitesimal world of QCD degrees of freedom and the 'macroscopic' world 
of hadronic interactions. 

Scaling, a somewhat robust model-independent prediction of current algebra, is broken by 
the evolution of these parton distributions. Completely calculable within perturbation theory, 
violations of scaling in deep-inelastic scattering provide a very useful test of the accuracy of 
QCD which is not obscured by hadronic effects we do not understand. Experimental observa- 
tion of scaling violations agrees with the predictions of QCD to better than 10% over a span 
in Q 2 of four orders of magnitude and in xb of one order of magnitude. 

Expanding our amplitudes in the unphysical region of large xb, we can use DIS to derive 
a local operator expansion for the product of two electromagnetic currents. This expansion 
not only allows us to study the analytic structure of the amplitude itself, but also provides a 
useful playing field on which to test various assumptions about the behavior of operators in 
a quantum field theory. Furthermore, several of the operators that appear in the expansion 
are important in their own right as representatives of flavor currents, axial currents, and the 
stress-energy tensor of QCD. We will find this aspect very useful in motivating the topic of 
the next chapter. 

As a whole, deep-inelastic scattering has made tremendous contributions to many aspects 
of modern physics. 



149 



Chapter 3 



Deeply Virtual Compton Scattering 



In a recent paper, X. Ji introduced deeply virtual Compton scattering (DVCS) as a probe to 
a novel class of "off-forward" parton distributions (OFPD's) {$3. DVCS is a process in which 
a highly virtual photon scatters on a nucleon target (polarized or unpolarized), producing an 
exclusive final state consisting of a high-energy real photon and a slightly recoiled nucleon. 

Several interesting theoretical papers which study the DVCS process further have since 
appeared in the literature. In Ref. p7| , the single-quark scattering was recalculated using 
a different, but equivalent definition of the parton distributions. The evolution equations of 
the distributions were derived and some general aspects of factorization were discussed. In 
Rcf. |^|, the evolution equations for OFPD's were derived and the leading- twist DVCS cross 
sections were calculated at order a®. Some past and recent studies of OFPD's can be found in 
. The DVCS process was considered as a limit of unequal mass Compton scattering, which 
was studied from the point of view of the operator product expansion, in Ref. |f70| . Some early 
studies of unequal mass Compton processes can be found in Refs. fn], [7^]. 0(a s ) corrections 
to DVCS were studied in Refs. {7^, [74|, [7j| from different perspectives. 

The main motivation for the present study is to see if the theoretical basis for the DVCS 
process is up to par with other well-known perturbative QCD processes. More explicitly, we 
discuss the existence of a factorization theorem for this process. For general two virtual photon 
processes in the Bjorken limit, factorizability is suggested by the analysis of DIS in the last 
chapter. In the case of DVCS, where one of the photons is onshell, the situation could be 
different. Potential infrared problems can arise because of the additional light-like vector in 
this special kinematic limit. However, it is believed that these complications will not ruin the 
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factorization properties |p8[ . 

The material presented in this chapter was originally published by X. Ji and I in p5[ . It 
is structured as follows. In the first section, I will outline one of the theoretical motivations 
for the study of DVCS. The kinematics of this process allow measurement of certain matrix 
elements which are related to the total angular momentum carried by quarks in the proton. 
Since the angular momentum carried by quark spin is measured in DIS, DVCS can help isolate 
the contribution due to quark orbital angular momentum. This quantity can give us great 



insight into the wavefunction of the proton. Section 3.2 is devoted to the kinematics of DVCS 
and the introduction of OFPD's which embody the nonperturbative physics at leading twist 
in DVCS. 

To see factorization at work, it is instructive to work out one-loop examples. We will do 



this explicitly in Section 3.3. As in Section 2.2, we consider the unphysical process of DVCS 
on onshell quark and gluon targets. For completeness we consider both the symmetric and 
antisymmetric parts of the amplitudes, which are related to helicity-independent and depen- 
dent parton distributions, respectively. The only omission is the gluon helicity flip amplitude, 
which is discussed in Ref. Mm. We will see that the collinear infrared divergences can be 



interpreted as the one-loop perturbative parton distributions in Section 3.4. This property is 
independent of the special kinematic limit of DVCS. 

A general proof of factorization in DVCS was first given by Radyushkin in his approach 



based on the a-representation |67|. Here, we give an alternative proof using the tools intro- 
duced in the last chapter. According to these, the infrared sensitive contributions in a generic 



Feynman diagram can be represented by reduced diagrams. In Section 3.5, we will see that 
the leading reduced diagrams for DVCS are identical to those present in DIS. Therefore, the 
proof of factorization at this twist level is identical to that for DIS. 

The factorization properties of general two virtual photon processes can be summarized 
beautifully in terms of Wilson's operator product expansion. This expansion requires operators 
with total derivatives to describe the off- forward nature of the process ]7G, u\ \TM . In Section 



3.6, I show how we can convert our one-loop results into Wilson coefficients of the twist-2 



operators in the MS scheme. The resulting operator product expansion is valid for forward as 
well as a certain class of off-forward matrix elements. 
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A summary and conclusion of this chapter appears in Section 3.7. 



3.1 Motivation 



One of the most interesting contemporary theoretical studies in nuclear physics involves the 
origin of nucleon spin. Along with the discovery that protons are composite objects comes the 
realization that their spins must somehow be generated dynamically by QCD. As described in 
the Introduction, the earliest quark models saw hadronic spin as a simple sum of the spins of 
the constituents. The three-quark structure of baryons led to spins of 1/2 and 3/2 for ground 
states, with orbital excitations providing higher half-integer spins. However, as viewed from 
QCD, these models are fundamentally flawed. They make no mention either of the sea quarks, 
which certainly must be taken into account in a full-fledged quantum field theory, or of the 
gluons, which are expected to be responsible for the quark interactions. Furthermore, they 
tacitly assume that the ground state contains no orbital excitations. While this is certainly 
true in non-relativistic quantum mechanics, there is no reason to expect it to be the case in a 
highly relativistic quantum field theory. These objections motivate us to take another look at 
the origins of nucleon spin, this time from the standpoint of QCD. 

As we have seen in the last chapter, the polarized quark density, f g >/ q (x), represents the 
difference between the probability densities for positive and negative hclicity quarks carrying 
momentum fraction x. This promotes the identification 



of the first moment of the polarized quark distribution with the amount of target spin carried 
by quark helicity. We will see below that this relation is indeed correct. This contribution 
has been measured in several experiments, most notably the European Muon Collaboration 
(EMC) in 1987, with the result that quark spin accounts for only 12 ± 17% of the proton's 
spin at a scale of approximately 10 GeV 2 .[] This completely counterintuitive result caused the 
so-called 'spin-crisis' and put to bed any notion that the naive quark model could be used for 

1 The fit used to extract this value depends on an assumption of SU(3) flavor symmetry. This assumption 
accounts for much of the quoted error bar. A recent global analysis of the relevant data can be found in Ref. 




(3.1) 



& 
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anything more than the crudest of calculations. Apparently, ~ 85% of nucleon spin must be 
attributed to orbital and gluonic effects. To further classify the spin sources, it was proposed 
that the first moment of the spin-dependent gluon distribution contains information on the 
contribution due to gluon spin. Unfortunately, this is not the case. The subtleties associated 
with gauge invariance forbid us from separating the gluon contribution to hadronic spin into 
separate pieces representing 'orbital angular momentum' and 'helicity angular momentum'. 
These two quantities do not make sense separately for gauge fields.^ 

A correct analysis of nucleon spin begins with the Pauli-Ljubanskii vector 

W" = ~^ va0 PJ a0 (3.2) 

introduced in Appendix |B . 2| . As in the appendix, we classify particle states by eigenvalues of 
the momentum operator P and one of the components of the Pauli-Ljubanskii vector. In the 
rest frame of a nucleon, p^ = (Af, 0, 0, 0), the 3-component of W is MJ 12 . The statement that 
a proton has spin-1/2 means that there are two eigenstates of W 3 : 

W 3 \p^[)=-\m\p^[) ■ (3.3) 
The normalization of our states is such that 

(p', ti \p,h) = 2p a S hhl {2ir) 3 S {p- f ) , (3.4) 
where h and h! represent \ and J., so 

( P A\j 12 \pA) = l2M(2n) 3 S^(0) . (3.5) 

Our choice of the 3-direction was arbitrary, as was our choice of frame; in general, for an 
arbitrary spin direction s M and nucleon momentum p M , we have^] 

(ps W» ps^ = Ms^2 P °(2tt) 3 S^{0) . (3.6) 



2 This fact is exemplified by the discussion following Eq.(2.237). The separation of gluon spin and orbital 
angular momentum leads to this positive anomalous dimension. 

3 My normalization is such that s 2 = — 1/4 rather than —1. 
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In order to understand how the spin of a proton is generated, we need to decompose it into 



several contributions. As shown in Appendix D.2, the spatial rotation generator: 



J« = / d 3 X {J% + J% + J%) (3.7) 



3% = ~^V^ (3.8) 

jV = 1/,* (x* W* - a? itf) ifi (3.9) 

J* = - (x l T ja ~ x j F a ) T\ (3.10) 

split nicely into three separately gauge- invariant pieces. Writing S = W/M and substituting 
these expressions into the definition of W, we find thatj^] 

S = J d 3 x(s qs + S qo + Sg} (3.11) 

S qs = (3.12) 

S qo = V f (x x ~iVj i/) (3.13) 

S g = x X (S X B) , (3.14) 



where £ and B are the chromo-electric and -magnetic fields, respectively, defined by £ l = —T a% 
and B l = Equation (3.12) has the obvious interpretation as the contribution from 



quark intrinsic angular momentum since it gives a different sign to the left- and right-handed 



components of the quark fields. Similarly, (3.13) is what we expect from an orbital contribution 



to angular momentum once we recognize — iD as the momentum operator in a gauge theory. 
The remaining term is attributed to gluons. The nonlinear behavior of the gauge field under 
gauge transformations prevents and further decomposition. The canonical approach to angular 



momentum discussed in Appendix D.2 shows conclusively that this separation is unavoidably 
gauge-dependent. However ( 3.14 ) does have a nice physical interpretation. Since £ x B is 
the Poynting vector representing classical energy flow, this term has the expected form of an 
angular momentum. In fact, it is exactly what we would write down for the angular momentum 
of a classical electromagnetic field. 

4 As always, neglected SU (3) and flavor indices are fully contracted. 

5 As usual, a vector hat on a 4- vector denotes the 3- vector formed from its spatial components - with indices 
upstairs. 
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The contribution of each operator in our decomposition is given by the matrix element 



ps 



d 3 x Si(x) 



^ = (2 7 r) 3 ^ 3 )(0)(p S |s ; 4 (0)|p S ) 

= 2M ai s(27r) 3 (5 (3) (0) . (3.15) 



The last equality follows from rotational invariance and the fact that QCD conserves parity.^ 
The constants a$ characterize the contribution of operator i to the nucleon spin. In view of 



(3.6) and (3.11), we have the sum rule 



a qs + a qo + a g = 1 ■ (3.16) 

At this point, we are free to use any means necessary to extract the Oj's. These objects are 
inherently nonperturbative and so must be measured in experiment (at least in this stage of 
the game). 

As we saw in the last chapter, certain matrix elements show up naturally in physical scat- 
tering processes. We have only to calculate their coefficients and perform the measurements. 
In particular, hadronic matrix elements of the operator tp jA^tp can be measured in polarized 
DIS. Exploiting Lorentz invariancej], we can extract a qs from these measurements : 



(ps |'0T M T5'0| ps) 
(ps \ip fi^ 5 tp\ps) 



AMa qs s^ 



Aha n 



(3.17) 



where in the last line we have chosen a frame in which the hadron is moving ultra-relativistically 
along the 3-axis. In light of Equation ( |2.79 ) , a qs is just the first moment of the polarized singlet 
quark distribution,^] 

a qs = [ dxf s/T (x) = AN S/T . (3.18) 



As mentioned in Section |2.3| , this quantity is not conserved by the interactions of QCD. 
It a scale-dependent object, like the QCD coupling. In fact, all three of the operators in 



Equation (3.11) evolve with scale. This is a completely natural phenomenon. Only the sum 



6 The left hand side of this equation is odd (doesn't change sign) under parity, so it must be proportional 
to a vector which is also odd under parity. The only such vector in this problem is s. 

7 and the fact that QCD conserves parity 

8 The antiquark contribution must, of course, be included. As expected, it simply extends the integration 



into the negative real axis according to (2.75) 
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of all three contributions must be scale-independent. This means that it makes no sense to 
ask how much spin is carried, for example, by quark orbital angular momentum. Only when 
a scale is specified do questions such as this have meaning. At high scales, one can calculate 
the evolution of these operators in perturbation theory. As the scale runs to infinity, only 
the eigenvector of the anomalous dimension matrix with the smallest (negative) eigenvalue 
becomes relevant. In this limit, one can show |34|, that the percentage of spin carried by 
quarks and gluons becomes 

a \ 2N c nfTp 3rif 

Sq{0O) = 2(A 2 - 1) + 2N c n f T F = 16 + 3n f (3 ' 19) 



2(A 2 - 1) 16 
2(A C 2 - l) + 2N c n f T F ~ 16 + 3n/ 



S flM = KTTF,. V, Jr.. TP = TTT-^T- ■ ( 3 - 2 0) 



This result is identical to that found in Section 2.5 for the distribution of longitudinal mo- 
mentum! Apparently, it is a somewhat general feature of the asymptotic limit. This is not 
completely surprising since its interpretation is simply benevolent sharing among partons. 
However, what this result implies about the distribution at moderate scales is wholly unclear. 

As mentioned above, experimental results indicate that the quark spin contribution to 
proton spin is ~ 15% at moderate probing scales (10 GeV 2 ). This implies that a full 85% of 
the nucleon spin is attributed to gluonic and orbital effects at these scales. Unfortunately, the 
other operators are not readily available for comment. In order to measure a qo directly, one 
would have to find a process which is sensitive only to the quark orbital angular momentum 
- a formidable, if not impossible, task. Luckily, all is not lost. The primordial form of J 
involves the energy- momentum tensor of QCD. Hopefully, we can entice this somewhat less 
exotic operator to help us extract the information we desire. 

We have already studied a process in which matrix elements of the energy-momentum 



tensor appear - unpolarized DIS. According to ( 2.9S| ) and (2.106), the second moments of 



the unpolarized nucleon structure functions are proportional to its diagonal matrix elements. 
Unfortunately, things are not as simple here as they were in the quark spin sector. The operator 
that we want diagonal matrix elements of is 

j 13 = [ d 3 x (x i e 0j - x j e 0i ) . (3.2i) 
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Writing 



( J i] ) = lim ( P + — 

Ae— \ 2 



lim 
lim 

A<-'^0 

lim 

Af — >o 



d 3 xe 
d 3 x ( P 



A 



A 
~2 



d 



X l e 0j -x j e 0i 

ix-A. \ qOj _ 



2 



(3.22) 



P 



A 



(2^) 3 <5(a) 



we see that the spatial coordinates in the definition of J can be converted into derivatives on 
0's off-diagonal matrix elements. 

Thanks to the separable form of O, this expression for (J) can also be split into independent 
quark and gluon pieces. Off-diagonal matrix elements of these two pieces can be separately 
parameterized by form factors :f] 



A q JA 2 )^P^ 
+ B q , g (A 2 )P^ia v]a A a /2M 
+ C q<g (A 2 )(A Al A !y - g^A 2 ) /M (3.23) 

u(p-* 



+ C q JA z )Mg^ 

Here, (U) U is the free wavefunction for an (outgoing) incoming nucleoli. Since we wish to 
consider a derivative of this object at A = 0, only quantities linear in A can contribute. 
Lorentz invarianceQ implies that the scalar UU can depend only on A 2 , so contributions to 
(J) can come only from the A and B terms. Using the Gordon identity, 

A s 



r ( P + ^U^P^ufp-^- 



U [ P + = ) \P^P V jM 



(3.24) 



P^ia v)a A a j2M 



U IP- 



it becomes obvious that 



% g \P) = \ K S (0) + B q . g (0)} U (P) a^U(P) (2ir) 3 S (3 \0) 



(3.25) 



J Conservation of the full energy-momentum tensor implies that C q = —C g . 
10 along with the fact that A ■ P = since both external states are on the same mass-shell 
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in the rest frame of the nucleon. Contracting with e 13 /2 and comparing with (3J3) in the rest 



frame, we see that the quark and gluon contributions to hadronic spin are given byP] 

S q , g =A q , g (0) + B q , g (0) ; J2^M0) + B 1 (0)} = 1 . (3.26) 

q>g 

This is an extremely interesting result. It states that we can obtain information about the 
spin structure of a hadron from unpolarized scattering experiments. This is not a completely 
crazy idea since polarization concerns the alignment of a particle's spin rather than the way 
it is distributed among its constituents. Still, it is somewhat unexpected that we can obtain 
such useful information on spin without polarization. 

Although this result still does not tell us how to measure the quark orbital contribution 
to proton spin directly, knowledge of the above quantities coupled with the information ob- 
tained from polarized DIS allows us to extract this information. At present, we have very little 
information on the nonperturbative momentum distribution of partons within hadrons. The 
distributions measured in DIS give us information only about longitudinal momentum distri- 
butions. The quark orbital contribution to the proton spin will give us valuable information on 
the transverse momentum distribution. In addition, since the proton represents the absolute 
baryonic ground state, information on its orbital distribution tells us information about the 
structure of this global energy minimum of QCD. 



However simple it seems, Equation (3.26) does have a catch. As mentioned above, DIS 



allows us to probe diagonal matrix elements of . According to (3.23), the form factor A(0) 
can be extracted from these measurements. Unfortunately, the other form factor necessary for 
our present purpose does not contribute to diagonal matrix elements. In order to extract -B(O), 
we will have to find some way to probe slightly off-diagonal matrix elements of this operator. 



We discovered in Section 2.5 that the local operators appearing in the QCD stress-energy 
tensor also contribute to the light-cone operator product expansion of two electromagnetic 
currents. Although this expansion is valid only for diagonal matrix elements, it is certainly 
plausible to assume that the operators of interest will also contribute to a generalized expansion 
which extends the range of validity. If this is the case, then we can extract information on 
the off-diagonal matrix elements of by studying off-forward two-photon processes. In fact, 

11 I have used the fact that U* 7 -fsU = AMs in the rest frame. 
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it is easy to show that all of Chapter 2 generalizes quite easily to off-forward processes in 
which both photons arc deeply virtual (though with different virtualitics) . The calculations 
are almost identical. However, there is a major philosophical difference between these two 
processes. Deep-inelastic scattering only becomes a two-photon process when we sum over 
final states and use the optical theorem to express its cross-section as the imaginary part 
of the Compton amplitude. Experimentally, it is electron-proton scattering with only stray 
photons in the initial or final states. On the other hand, the general two-photon processes 
proposed to help us extract -B(O) are intrinsic Compton processes. Since the initial and final 
states are necessarily different, the imaginary parts of these amplitudes are not related to 
any cross-section. To get the cross-section for a physical process, we have to square these 
amplitudes! Furthermore, in order for these processes to be physical, the initial and final 
states must be onshell. In DIS, the virtual photon is not a part of the initial or final state; 
it is exchanged between the electron and the proton during the process. We can generate the 
initial virtual photon the same way in the present case, but the deeply virtual photon in the 
'final' state presents a real problem. It is simply too much to ask for the electron to bear the 
burden of both virtual photons. Unfortunately, spacelike virtual photons are in no position to 
bargain. If they are not quickly absorbed by other particles, they simply will not form in the 
first place. 

One way out of this dilemma is to move into a region where the final photon is not spacelike, 
but timelike. Timclikc virtual photons have many options open to them. Depending on their 
energy, they can decay into many physical final states. However, the kinematics associated 
with timclikc virtual photons requires a large momentum transfer between the initial and final 
states. Since we intend to extrapolate our form factors to zero momentum transfer, this is not 
ideal for our purpose. 

Alternatively, we can relax the condition that both photons have a large virtuality and allow 
the final state photon to go onshell. This process, called deeply virtual Compton scattering 
(DVCS), is the subject of the present chapter. We will find that it is almost ideal for our 
needs in that it probes certain nonperturbative structure functions, the off-forward parton 
distributions (OFPD's), whose moments are related to off-diagonal matrix elements of the 
stress-energy tensor. Furthermore, its kinematics permit the square of the momentum transfer 
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to vanish with the nucleoli mass. This fact allows a smooth extrapolation to our region of 
interest. 



3.2 Kinematics and Parton distributions 

Although our ultimate interest is in deeply virtual Compton scattering, we start by considering 
a general Compton process involving two offshell photons with different virtualities. This and 
a suitable choice of kinematic variables allow us to exploit the full symmetry of the problem. In 
the general Compton process, a virtual photon of momentum q + A/2 is absorbed by a hadron 
of momentum P — A/2, which then emits a virtual photon with momentum q — A/2 and 
recoils with momentum P + A/2. The three independent external momenta can be expanded 
in terms of the light-cone vectors 

p" - (p + ,0,0,p+) , (3.27) 

n" - ^t(1,0,0,-1) ' 
where the 3-direction is chosen as the direction of the average hadron momentum (P), and 
two transverse vectors. As in previous chapters, we call the coefficient of the + component 
and that of n M the — component. Thus we write 

M 2 -i/4 „ 



2 



cf = -CP^ + ^n", (3-28) 
A^ = -2& 1 + £(M 2 - i/4)n" + A^ , 

where M is the hadron mass (which is taken to be the same for the initial and final hadrons), 
t = A 2 , Q 2 is the virtuality of q^, £ is a measure of the difference of the virtualities of the two 
external photons, ( is defined as 



Q 2 x | .A | 4sj,(M'-t/4) (3 29) 

c - 2 XB {M*-t/A) y 1 + v + 2 J' ( j 

and A^_ is a vector in the transverse directions which has squared length —t (l — £ 2 ) — 4£ 2 M 2 . 
We have also introduced xb = Q 2 /(2P ■ q), the analogue of the Bjorken scaling variable in 
this off-forward process. These expressions limit the range of £ to 
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for fixed t, or the range of t to 

4£ 2 M 2 



t > , (3.31) 



for fixed £. 

According to the last section, our region of interest is t = 0. In order for the final state 
photon to be onshell,^ we must have £ 7^ 0. In light of the above expressions, this is only 
possible if we take the limit M 2 — » before t — > 0. Unfortunately,^ we do not control the 
mass of the nucleon. However, it is kinematically allowed to take £ small so that —t can 
be much smaller than M 2 . Performing several of these measurements will provide a smooth 
extrapolation to the required values. 

In the Bjorken limit, these expressions simplify considerably. Since we consider only the 
leading twist in this chapter, we may neglect all but the + components of P M and A M (in order 
to form large scalars, one must dot the + component of a vector with the — component of q). 
Hence, in the limit Q 2 — > 00 (xb remaining finite), we may write 

9" ~ -x BP » + ^-n" , (3.32) 
2x B 



We note that the external invariants have been reduced from six to three by enforcing kine- 
matics and taking the Bjorken limit. We express these three scalars in terms of one mass scale, 
Q 2 , and two dimensionless parameters, x B and £. Apparently, the limit £ — > recovers the 
DIS results from the last chapter. We will find this to be a useful check of our calculation. 
The DVCS limit is given by £ — > Xb- As mentioned above, we cannot do DVCS when £_ = 0.0 
However, measurements at small xb may allow —t small enough that the variation to zero 
holds no surprises. In particular, for the relatively large value xb ~ 0.1, we can have -t as 
low as ~ 0.04 GeV 2 . Since typical hadronic scales are ~ 0.1 GeV 2 and larger, this value of —t 
is actually quite close to zero. 

12 in the Bjorken limit 
13 Well, maybe fortunately... 

14 Otherwise, we could've just extracted the required information from DIS measurements! 
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Our goal is to factorize the short and long distance physics of the Compton amplitude 



T» v = i I d A ze lc >- z ( P + — 



2 



(3.33) 



in the Bjorken limit, where J M = e q il> q , y ,J "ip q is the electromagnetic current and tp q is the 
bare quark field of flavor q and charge e q . Note that the operator here is identical to that 
considered in DIS. Only the external states are different. This immediately implies that all of 
the diagrams we need to calculate are topologically identical to those of the last chapter. The 
only difference is in the momenta of the external states. 



As before, the nonperturbative contribution to the Compton amplitude in Eq.(3.33) can be 
expressed in terms of the off-forward parton distributions contained in parton density matrices 
fl6"c| . For quarks, we define the light-cone correlation function 

A 



P 



1 



(3.34) 



where h is the helicity of the proton. The ellipses denote contributions either of higher twist 
or chiral-odd structure, which do not contribute to the leading process under consideration. 



The gauge link, Q n , is defined as in Section 2.3 with a contour that runs along the n direction. 
Note that this density matrix is just an off-diagonal matrix element of the same field operator 
we considered there .f^ 

Isolating the scalar distributions, we find 



_ i r d\ 

~ 2 J 27 

F q / T {x,i,t) 



— iXa 



p 



P- 



(3.35) 



(3.36) 



Ah / 2tt \ + 2 



i> q f^nj G n (^n,~n ) ri-.-.r,, ( 



A 
2' 



15 Here, we are actually interested in this amplitude rather than the analogue of W^ v ■ 

16 The difference in definition between this correlation function and that in Section 2.3 (the arguments of 



the parton fields) is not important there since the matrix element is diagonal. However, here the off-diagonal 
nature of the matrix element creates a dependence on the positions of the fields. I have chosen the symmetric 
form for later convenience. 
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The ever-present renormalization scale, n, has been suppressed here for brevity. Dependence 
on the square of the momentum transfer, t, will also largely be ignored in the following as it 
will not affect most of our discussion. 

Before we go on, let us see how these distributions are related to the form factors A(t) and 
B(t) of the stress-energy tensor. Writing 



— e lXx ( P-\ 

2tt \ 2 



A 



2 



U [ p + o- H q/T (x,^,t) +ia^A a /2M E q/T (x,C,t)\ U[P 



(3.37) 



we see that 



F q/T (x,£,t) = \ U (p + j ) [flH q/T (x,Z,t) 



i^ a /2M E q/T (x,Z,t)] u(p-^j 



(3.38) 



On the other hand, the second x-moment of F q i T (x, £, t) is an off-diagonal matrix element of 
the quark contribution to the stress-energy tensor 



dxxF q/T {x,i,t) = ~ [P+j 



ip q fin- iD ip q 



2 



Comparison with (3.23) gives 



dxxF q/T (x,£,t) = \ U ( P +^) A -i [l]J <- " T '"'",/ A,,/2.W B,,[l) 



+Ae/Mc q (t)} u(p-^) 



The Gordon identity, Equation (3.24), allows us to deduce 



(3.39) 



(3.40) 



dxxH q/T (x,£,t) = A q (t) + 4£ 2 C q {t) , 
dxxE q/T (x,Z,t)=B q (t)-4£ 2 C q (t) , 



(3.41) 
(3.42) 



from which we immediately see that 



S q = A q (Q)+B q (0) = Jdxx [H 9/T (x,€,0)+E q/T (x,Z,Qj\ 



(3.43) 



17 Note that here the '<->' adorning T> is quite important. For off-diagonal matrix elements, it must not be 
forgotten that S' 1 " is a benevolent operator which treats both external states with the utmost care and respect. 
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Luckily, the quark contribution to angular momentum is not sensitive to £. The lower bound 
on —t causes us to want £ small anyway, but it's certainly nice to know that we need not 
perform an extrapolation to £ = 0; only t = is strictly required. 

Many properties of the distributions E(x,£,t) and H(x,£,t), as well as their sister dis- 
tributions in the polarized, chiral-odd, and gluon sector are summed up nicely in Ref. J78[ . 
There, one can find details of their parton model interpretation, form factor decompositions 
for their moments, models representing their behavior, and sum rules that constrain them. 

The contribution of gluons to DVCS requires off-forward distribution functions. In analogy 
with Section 2.3, these are written]^] 



(3.44) 



1 



dX 



= ~ I ^e~ iXx (P+^ 



2x J 2vr 

i_ r dX 

4hx / 2tt' 



-iXx 



A 



b ot 



-2 U 



2 



P- 



(3.45) 



In this chapter, the fact that the external wavefunctions do not correspond to the same 
momenta implies that the identity uu = (1 + 2/175)^/2 cannot be used. Instead, we calculate 
the amplitudes via a direct substitution of the expansions of the parton distributions. Inter- 
preting these functions as their leading order values (which are simple (S-functions for partonic 
targets), we will see that the one-loop calculation generates their evolution. For example, the 
polarized quark amplitude is obtained by tracing with 75/1^ because this is the structure that 



appears multiplying F q /T in (3.34). The subtleties associated with defining 75 in d dimensions 
make the polarized result ambiguous. To resolve the ambiguity, one must use a well-defined 
75 scheme. Several different schemes can be found in the literature;^] the prescription used 
here is equivalent to that introduced by 't Hooft and Veltman |lC| ]. The fact that this choice 
favors certain directions over others makes it necessary to multiply the polarized contribution 
of diagram 3.1e by the factor (1 + <5e)0 where S is determined by the requirement that the 
nonsinglet axial current is conserved, whenever one wishes to consider the chiral properties of 

18 Here, A leads to an additional twist-2 term involving gluon helicity flip. This term is considered in 

19 See Appendix 

20 This amounts to a redefinition of 7 a 757 a . 
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a theory |79|| . Since we are not concerned with the chiral nature of QCD at the moment, we 
will ignore this technicality. 

In the gluon sector, the fact that the conceptual idea of removing one gluon and the actual 
gauge invariant distributions that describe this process do not coincide leads to extra factors. 
In this case, one can show that 



—e~ lXx ( P + = 
2ir 



A 



A 



A 



2 



X-i-X _ 



d-2 



g^F g/T (x,0 - ie^(2hp) a n F g/T (x,O 



,(3.46) 



where I have defined x± = x ± £. Here, we see explicitly the extra factor of 1/(1 — e/2) 
multiplying the unpolarized gluon amplitude. 



3.3 One-Loop Compton Amplitudes 
on Quark and Gluon Targets 

In this section, we perform a one-loop calculation of the general Compton scattering on onshell 
quark and gluon targets in the Bjorken limit. The result will be used in the next section to 
show that the infrared divergences present in our amplitude are generated exclusively by the 
off- forward parton distributions. This property does not change in the limit of a real final 
state photon. 

We begin with an onshell quark target of momentum p. Here, there are two diagrams at 
leading order (LO) and eight at next-to- leading order (NLO). The renormalization constants 
necessary for this process cancel themselves out in exactly the same manner we saw in Section 



2.2| . Half of the diagrams are shown in Fig. 3.1. The other half will be taken into account 
by using the crossing symmetry, i.e., the simultaneous replacement of q — > — q and fi <-> v. 
The terms ±(xb — * — xb) in the following formulae reflect this contribution. Because of time 
reversal invariance, the Compton amplitude is also an even function of £. This relates the 
left and right vertex diagrams (Figs. 3.1c and 3. Id, respectively) to each other. The quark 
self-energy diagram and the box diagram are themselves ^-symmetric. This symmetry not 
only allows us to reduce the number of graphs at NLO from four to three, but also becomes a 
powerful tool which helps us compute each amplitude. 
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(b) (c) (d) (e) 

Figure 3.1: Diagrams for Compton scattering on an onshell quark to order a. 
v n 




Figure 3.2: The momentum flow in the box diagram 

Examining our graphs, we see that the self-energy diagram contains a loop integral with two 
Feynman denominators, the vertex diagram contains one with three, and the box with four. 
A one-loop integral with two propagators is straightforward. Difficulties arise, however, with 
the calculation of three- and especially four-propagator integrals. These difficulties may be 
avoided in this calculation because of several simplifications. Consider first the box diagram. 
The loop integral is of the form (with momentum routing as shown in Fig. 3.2) 

f d d k Tr[7 a (^-^)7^+^)7' y (ff + ^)7^(75)] f , 

J (2n) d (k + &Y{k-tv) 2 (k-pf(k + q y ' [ ■ > 

where I have replaced A** by — 2£p A '. In order to simplify the integral, we express the trace as 
a sum of terms which cancel one of the propagators. This can be done because both fc 2 and 
2p ■ fc can be written as linear combinations of (fc + £p) 2 and (fc — £p) 2 , and the trace vanishes 
whenever fc~ and k\ do. 

Now that we have shown that a denominator can be canceled, we have effectively reduced 
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the four-propagator problem to a three-propagator one. Since we have only shown that the 
numerator will be a linear combination of two different denominators, rather that proportional 
to one, the four-propagator integral will in general become two three-propagator integrals. 
However, we may use the £ symmetry by writing 

1 



(3.48) 



In this way, we consider only the (k 4- £p) 2 cancelation and let the symmetry take care of 
the rest. The integral we now have is exactly the same (up to numerator differences) as that 
arising from the right vertex correction. We will see that this basic integral is the only one 
we must calculate to obtain both the polarized and unpolarizcd amplitudes for the quark and 
gluon contributions (if we forget, for the moment, the simple self-energy diagram). 
The integral has the form 

/d d k Numerator 
{2ir) d P - V? + ie] [(k - &) 2 + is] [(k + q) 2 + ie] ' 

We have found that this integral is easily done in light-cone coordinates by expanding k 11 in 
terms of the light cone momenta, and rt M . As in Section [2.2| , we first do the k~ integration 
by contour in the unphysical region of large xb and then the transverse integrations. The k + 
integration is left until the end. Writing k + = yp + , the value of the integral ( 3.49Q is 

'Q 2 V e/2 r(e/2) f (xB-a^- &12 



16tt 2 V 4tt 



1 - £ IV X B 



dy{^-\ (I 

\x B ~ a 



y-g 

xb — a 



e/2 



N(a) 



(3.50) 



where N(a) is defined by 



Numerator 
N(a) = 



k-=k 2 ± /2p+(y-a) 

a xb(xb — a) 



a + 0k 



2 . 



(3.51) 



■J 2 (y - a){x B - y) 

Doing the y-integrals requires some care because a delicate cancelation must occur if one is to 



get a finite result, but the treatment is straightforward. After the £ and crossing symmetries 



167 





(a) 



d+?)P 



d-5)P 



(b) 



q+?p 



Figure 3.3: Diagrams for gluon Compton scattering at one-loop, 
are used, we find the full NLO result for the symmetric quark amplitude^ 
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(3.52) 
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(3.53) 



log ( 1 - — 

X B 



(Xi 



x B - 1. 

The divergences in these amplitudes are infrared divergences since renormalization has removed 
all ultraviolet singularities. Their presence signals the existence of nonperturbative physics in 
the process. As mentioned earlier, they will be factorized into nonperturbative matrix elements 
whose values can be extracted from experiment. We will see this explicitly in the next section. 
For now, we summarize the contributions from the gluon sector. 

The six graphs that contribute to the LO amplitude for gluon-photon scattering can be 
reduced to three by reversing the fermion number flow, and one of these can be eliminated 



We take fi = Q and fiu in the _L directions as before. 
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by crossing symmetry. The two distinct graphs we are left with are shown in Fig. 3.3. The 
denominator of Fig. 3.3a is identical to that of the quark box. Again, the numerator is seen 
to vanish whenever k 2 and 2p ■ k do, which allows us to cancel one of the propagators exactly 
as above. Fig. 3.3b is somewhat more tricky. This diagram is itself symmetric under both 
crossing and £ symmetry. Labeling the momenta as shown, we see that under an integral the 
symmetry q — > — q is equivalent to A <-> B and C <-> D and £ — > — £ is equivalent to B <-> D 
and A <-> C. We also note that here it is possible to represent 1 as a linear combination of 
Feynman denominators, which guarantees our ability to cancel one. Of course, since 1 does 
not depend on q or £, it may be represented in the symmetric form 

1 



1 = -- 



+ (£ ^ -£) + (<? ^ -g). 



(3.54) 



2(1 -Of « 

Now it remains only to calculate the trace in a symmetric way and substitute the result into 
the formulae of the quark calculation. Averaging the gluon polarization for the symmetric 
amplitude in d dimensions, one finds 
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(3.55) 



+ (x B -» -a;s) 



for the symmetric amplitude and 
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(3.56) 



- (xb -» -^b) > , 



where ?y = ±1 is the helicity of the gluon 'target', for the antisymmetric amplitude. 
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3.4 One-Loop Factorization in DVCS 



We now turn to the infrared divergences present in all of the amplitudes. These divergences 
arise from regions of the loop-momentum integration where some of the internal propagators 
are near their 'mass shells'. In these regions, perturbative calculations are clearly meaningless. 
The standard procedure of fixing this problem is to factorize the amplitudes into their infrared 
safe (i.e. devoid of infrared divergences) and infrared divergent pieces, interpreting the latter 
as nonperturbative QCD quantities. The goal of this section is to show that all infrared 
divergences present in the Compton amplitudes can be associated with the off-forward parton 
distributions. 

Since we consider factorization within the framework of perturbation theory, we need to 
compute the parton distributions in quark and gluon targets in perturbative QCD. At the 
leading order in a s , one has 



(3.57) 



At next-to-leading order, 



where 



2tt 
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^y-x-ie 



(3.59) 



can be determined from conditions analogous to (2.13C) and 

x 2 + 1 - 2e 



P qq {x,i) = C F 
= C F 



(i - x + ie){\ - e) ' 

x + £ / 2£ 
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(3.60) 



is calculated in the same way as P q ^ q (x) in Section |2.3| . The quark distributions contain 
infrared divergences signaled by the presence of the 1/e terms. These divergences reflect the 
soft physics intrinsic to the parton distributions. F^',, calculated for the quark target, satisfies 
the evolution equation derived in Ref. [p8| : 



2 DgFa/ T (x,£,n 2 ) ajjj?) f dy (x A 2 
D^J. 2tt J x y \y y 1 



(3.61) 
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where b is summed over all parton species and the P's are the off-forward Altcrclli-Parisi 
kernels, or splitting functions. The 'covariant' derivative is defined to include A a , the endpoint 

Note that these kernels reduce to the standard forward splitting 



contribution in Eq.(3 



function ( |2.142| ) in the DIS limit £ -> 0. 

We can re-express the symmetric part of the quark Compton amplitude in terms of the 
unpolarized, off-forward quark distribution^] 
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and find that the infrared physics is completely contained within F q i / q (x, £). Analogously, we 
find that the antisymmetric part of the quark Compton amplitude can be expressed in terms 
of the polarized off-forward quark distribution 



■ie a ^n a {2hp)pY, e l j dxF q , /q (x,0^^z 
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We now turn to Compton scattering on a gluon target. Infrared divergent contributions 
come from regions where the quarks in the box diagrams are nearly onshell. Therefore, it is 



For convenience, we have identified the factorization scaie i± with Q. 
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natural to associate these divergences with the quark distributions in a gluon target. At order 
a s , off- forward gluonic matrix elements of the quark distribution operators have the value 



where for x > £ 



= ^ (~- e +log(M 2 e^/4^)) P qg M , 

- 2 <-{i~ X ) 2 -e 



P qg {x,£) = 2T F 
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(3.64) 



(3.65) 



(3.66) 



F$l(x,0 =0 for x<-e 3 

On the other hand, the finite contributions come from regions where large momenta run 
through the quark loop. In these regions, the photon has an effective pointlike coupling with 
the gluons in the target. At leading order, the off-forward gluon distribution in a gluon target 
is just 

Fff g (x,S) = ±(8(x-i)-8(x + l)). (3.67) 

Using the above off-forward distributions, we re-express the symmetric part of the gluon 
Compton scattering amplitude 
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Similarly, we can re-express the antisymmetric part of the gluon Compton amplitude in terms 
of helicity-dependent, off- forward quark and gluon distributions 

9 



23 This accounts for the factor of 2 in Eq.(3.65). The antiquark contribution is treated with the quarks for 
simplicity. 
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The complete set of off- forward splitting functions at leading order can be found in Ref . j6^] . 

The one-loop Compton amplitude on a target T may be summarized by the factorization 
formula^] 



(3.70) 



1 dx 
l x 



\Xb Xb J \X B Xb , 

where C q , C g , C q and C g are shown to order a s in Eqs. |3~63] |3~69l ), respectively.^ 

In the above form, all infrared sensitive contributions have been isolated in the relevant parton 
distributions, which must be calculated nonperturbatively or measured in experiments. In the 
DVCS limit £ — > xb, the coefficient functions remain finite, although they have branch cuts 
there. This indicates that factorization holds for two-photon amplitudes even when one of the 
photons is onshell. We will argue in the next section that the above formula, one of the main 
results of this thesis, remains valid to all orders in perturbation theory. 



3.5 All Order Factorization of DVCS 

In this section, we generalize the one-loop result of the previous section, showing that the 
factorization formula Eq.( |3.7d ) is valid in the DVCS limit to all orders in perturbation theory. 



24 This form is correct for spin-1/2 targets. For spin-1, the replacement 2h — > A is required. This convention 
is fixed by the normalization of the distributions. 

25 I emphasize here that the contributions from longitudinally polarized photons and from photon helicity 
flip have been neglected. Both effects start at order a s , although longitudinal contributions are kinematically 
suppressed. 
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The one-loop result indicates that all soft divergences - those associated with integration re- 
gions where all components of some internal momenta are zero - cancel, whereas all collinear 
divergences can be factorized into the off- forward parton distributions. To see that this hap- 
pens also at higher orders in perturbation theory, it is important to understand how the soft 
cancelation happens in the simplest case. 

The self-energy diagram in Fig. 3.1b does not contain any infrared divergences because 
the intermediate quark is far offshcll. The vertex corrections in Figs. 3.1c and 3. Id potentially 
have infrared divergences, but a simple power counting argument indicates that these diagrams 
are in fact infrared convergent. Thus infrared divergences appear only in the box diagram. In 
the region where the gluon momentum k is soft, we can approximate the integral as 

d A k , 111 



(2n)* P ' P j/-k p-k fc2 ' (371) 
where p and p' are the momenta of the two external quark lines. On the other hand, the 
wavefunction renormalization of an 'onshell' quark, SZ, also contains infrared divergences. 
Grouping these divergent terms together, we have the entire soft contribution 

ll^(P^_v^_\l_ (3 72) 

2 J (2tt) 4 \p-k p' -k) k 2 ■ K ' 

In the collinear approximation, p' is proportional to p and thus the above integral vanishes. 

For higher-order Feynman diagrams, a systematic method of identifying, regrouping and 
factorizing infrared-sensitive contributions has been developed by Sterman, Libby, Collins, 



and others [|47|, |48|, |49|, [55|. As detailed in Section 2.4, this method essentially consists of 
the following steps: 1) simplify the Feynman integrals by setting all the soft scales to zero, 
including the quark masses; 2) identify the regions of loop integration which give rise to infrared 
divergences; 3) use infrared power counting to find the leading infrared-divergent regions; 4) 
show that all soft and collinear divergences either cancel or factorize into some nonperturbative 
quantities. In the remainder of this section, we examine the validity of the factorization formula 



Eq.( 3.70 ) in the limit of xb — £ following the above steps. 

In DVCS, the leading contributions come essentially from a massless collinear process in 
which the external momenta take the form shown in Eq.( |3.32" ). In this simplified kinematic 
region, all infrared-sensitive contributions appear as 1/e poles in dimensional regularization. 
If a contribution contains no infrared divergences, it comes from regions of loop momenta 
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Figure 3.4: General reduced diagrams for the DVCS process. 



comparable to the hard scale Q 2 , and thus is insensitive to the soft scales. An infrared-divergent 
contribution must come from the integration regions where some internal propagators are near 
their mass shells. The parts of these regions which will actually generate infrared divergences 
can be identified from the Landau equations, as we have seen, and characterized by reduced 
diagrams. We will argue below that the general leading reduced diagram for DVCS is the 
one shown in Fig. 3.4a. Here, an incoming virtual photon and an outgoing real one are 
attached to the hard interaction blob, which in turn is connected to the forward nucleon jet 
with two collinear quark lines or two physically polarized gluon lines. An arbitrary number 
of longitudinally polarized collinear gluon lines can also connect the hard diagram to the soft 
one. This diagram is identical to that of DIS. Indeed, the only real difference between DVCS 
and DIS lies in the fact that the final state photon is real. 

A real photon can have a pointlike coupling to quarks as well as an extended coupling via 
its soft wavefunction. In this case, one has an additional reduced diagram in which a jet of 
quarks and gluons emerges in the n M direction and combines into a real photon long after the 
hard scattering (Fig. 3.4b). Such a reduced diagram has already been considered in Ref. |5q] 



and is 0(Q 1 ) by infrared power counting. Indeed, according to the discussion in Section 




the photon wavefunction vertex has a soft mass dimension 1 = 2 (quark lines) —1 (photon 
state). A negative hard power (Q _1 ) is needed in T^ v to balance it out. Hence the soft 
coupling of quarks to photons will appear only at next- to- leading twist. 

The only remaining difficulties lie in the soft divergences shown in Figure 3.4c. However, 
since we have already shown that real-photon effects are sub-leading, one can use the same 
arguments here as for DIS. The differences appear only at higher orders in our twist expansion. 
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At this point, the only leading reduced diagram is identical to that in DIS. The arguments 
presented in Section 2.4 may also be applied here in almost exactly the same formP] The 



cikonal line together with the physical quarks and gluons and the nucleon jets form the off- 



forward parton distributions defined in Section 3.2. In the hard scattering, only the total 
momentum and charge supplied by collinear partons are important. Thus, one can calculate 
it with incoming physical partons carrying the total momentum and charge of all the collinear 
longitudinally-polarized gluons. In this way, we have a complete factorization of the soft and 
hard physics in the DVCS process. 

3.6 Generalized Operator Product Expansion 

and Wilson's Coefficients to the NLO Order 



As we saw in Section 2.5, the calculation of the Compton amplitude can be exploited to 
derive an expansion for the product of two electromagnetic current operators in terms of local 
QCD operators. Since the kinematics of that chapter require identical initial and final states, 
the OPE we derived there is valid only for diagonal matrix elements. Here, however, we 
have considered a somewhat more general case in which certain off-diagonal effects have been 
included. It stands to reason that we can use these results to derive a generalized OPE which 
is valid for these special off-forward matrix elements. 



In our operator derivation of the leading OPE, Equation ( [2.187 ), we ignored several 'total 



derivative' operators which cannot contribute to forward matrix elements. By treating the 
initial and final states symmetrically, systematically using T> in the place of 2?, we will see 
that these total derivative operators still do not contribute to the leading expansion. However, 
at next-to-leading order, they become indispensable as generators of the ^-dependence of our 
amplitude. In the remainder of this section, we will recast our factorization formula in its 
generalized OPE form. In the process, we identify these total derivative operators and obtain 
their Wilson coefficients to next-to- leading order in a s . The final result agrees with the known 



OPE, Eq.(2.208|), when diagonal matrix elements are considered. 



To derive the generalized OPE, we expand T tJ in a power series about xb — oo. In this 



3 The only difference is in the off- forward nature of the distributions relevant to this case. 
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way, we can express the amplitude in terms of moments of the parton distributions rather than 
the distributions themselves. Eventually, we will relate the moments of parton distributions 
to the matrix elements of local operators. After the aforementioned expansion, we have 



I*? = -g^Ts - ie a ^n a {2hp)pT A 



OO OO 



n=2 m=0 J \ D / \ D / a=q,g 



q J X \X B J \X B , 

The coefficients for the moments of the quark distributions in the expansion are 
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E- 
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(3.73) 
(3.74) 



(3.75) 



(3.76) 



(3.77) 



as in Section 2.5. The above expansion contains only positive powers of x and £. This result 
is not immediately obvious because of the x + x- denominators in the amplitudes. In the case 
of the gluon distribution functions, we have an additional factor of x+x- in the denominator. 
Since the final OPE contains only local operators, these factors have to be canceled in the 
process of expansion. This is indeed the case; the coefficients of the positive moments of the 
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gluon distributions are 
2tt 



m m + 2 

n + in n + m + 2 

-25i(n + m- 1) 



1 



m + 2 



m + 2 



- «2?£[2 + 25i(n + m-l)] 



n + m n + m+1 n + m + 2 
m + 1 m + 2 



(3.78) 



27r " \?i + m n + m + 1, 

Having obtained an expansion involving the moments of the distributions, we move toward 
a general form of the OPE. To this end, we consider the moments of the parton distributions. 
As before, these moments can be made into local operators via an appropriate use of integration 
by parts. The result is 27 



dxx n - l F g (x,0 = U P f 



tf> q (0) iV (pi ■ ■ ■ 7^)^(0) Pi) ■ ■ ■ »"» . (3.79) 

Defining 

i° n ^-^ = ^(0) ' ' ' 7^)^,(0), (3.80) 

where (• • •) signifies that the indices are symmetrized and the trace has been removed, we see 



that the replacement 

(n • id) q O+-+ -> 2ZO+-+ 
is valid for the matrix elements we consider. This prompts us to define 



(3.81) 
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(3.82) 



After replacing the moments of parton distributions in Eq. ( |3.74 ) with matrix elements of these 
operators and interpreting the result as an operator relation, we find the following generalized 
OPE, 



27 As mentioned above, the off-diagonal nature of these matrix elements complicates this process slightly. The 
derivatives that actually appear are 23 = 1/2(23 — 23), where the arrow indicates the direction the derivative 
is meant to operate. In DIS, this distinction is unnecessary since 23 = 23 in this case. 
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3C 



, / o n — m n 

-ie^ qa9 ,xYl E ( (Q2)n ) E eSUm.m.^-^ . (3-83) 

n— 1 m— ^ ' a,—q,g 

where the primed summations once again imply even or odd values only, depending on the 
initial value. It must be pointed out that the generalized OPE does not have a unique form. 
One can define xb as any nontrivial dimensionless invariant formed from the external momenta 
which remains finite in the Bjorken limit and expand the amplitude in inverse powers of this 
variable. This will lead to a different set of coefficient functions, but the physical content is 
the same. The choice of which OPE to use is determined by the specifics of the problem at 
hand. 

Since the operators O n ^ m are symmetrized and traceless, their rank is n. The mass di- 
mension of these operators is just the dimension of the fields plus that of the derivatives, or 
3 + n— 1 = 4 + n — 2. Hence these operators are all twist 2. As we saw in Section [2.5| , the 
small number of possible twist-2 operators in QCD led to autonomous evolution of the diag- 
onal matrix elements of our operators in the nonsinglet sector. Here, we see that for a given 
spin n there are in fact n independent nonsinglet twist-2 operators with the same quantum 
numbers that contribute to our generalized OPE. Since they are not constrained by symmetry 
to evolve autonomously, the off-diagonal matrix elements of these operators will mix as the 
scale is varied. It is this behavior that leads to the £ dependence of the off-forward splitting 
kernels above. 

Although Lorentz invariance cannot distinguish between these operators, the larger sym- 
metry group of conformal invariance can. It is the irreducible representations of this symmetry 
group, which are definite combinations of our operators O n<m , that diagonalize the leading or- 
der anomalous dimension matrix. Using this larger symmetry group, one can determine all of 



the above generalized Wilson coefficients from the results of Section 2.1 J73| . Unfortunately, 
conformal invariance is broken by a quantum anomaly^ beyond this order and can no longer 
constrain the form of the OPE. 



28 Conformal invariance is based on the fact that the (massless) QCD lagrangian contains no scale. However, 
dimensional transmutation generates a scale (/t 2 ) on which parameters can depend. Stated succinctly, the 
conformal invariance of QCD is broken by the /3-function. 
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The above expression contains only the contributions at leading order in 1/Q 2 . At the 
next order in 1/Q one has to consider operators of higher twist. Some of these operators are 
discussed in the next chapter. 



3.7 Summary of DVCS 

In this chapter, we have studied the QCD factorization for deeply virtual Compton scattering 
explicitly at one loop and then to all orders in perturbation theory. Our conclusion is that 
DVCS is factorizable in perturbation theory at leading twist. This statement has the same 
level of rigor as that for DIS studied in Chapter ||. The OFPD's probed in this process are 
not only interesting themselves, but also can be manipulated to yield information on the total 
spin carried by quarks in the proton. In conjunction with DIS data, we can use this to obtain 
the quark orbital contribution to proton spin. This quantity has tremendous implications on 
the nature of the proton wavefunction and is not readily obtainable from any other known 
process. This makes DVCS a prime candidate for experimental study in the next few years. 
Indeed, several experiments are already proposed at Jefferson Lab. 

Expanding our one-loop result for general off-forward Compton scattering about xb = oo, 
we were able to obtain a generalization of Wilson's OPE. Our expression is valid at leading 
twist for off-diagonal as well as diagonal matrix elements. This generalization requires the 
introduction of total derivative operators whose forward matrix elements vanish. Our one- 
loop result gives the coefficients of these operators to NLO in QCD. A full generalization 
would also require an analysis of longitudinal photon scattering, which has been done in Ref. 
[ frij , and the photon-helicity flip amplitude [fr6| . 

The scale evolution of total derivative operators can best be studied using conformally- 



symmetric operators [73 . In fact, it has been known for a long time that at the leading-log level, 
operators of the same twist and dimension evolve multiplicatively in Gegenbauer polynomial 
combinations. It is a simple exercise to transform Eq.( 3.83|) into this basis. This fact is a 



consequence of conformal symmetry in massless QCD. Although this symmetry is broken by a 
quantum anomaly |p0|, it still has implications on the behavior of the OPE. One can actually 



obtain the one- loop generalized OPE coefficients from the OPE introduced in Section 2.5 by 



exploiting the constraints from conformal symmetry. Unfortunately, the quantum anomaly 
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causes this method to become uneconomical for higher-order calculations. 

Measurements of the DVCS amplitude are made difficult by the fact that it competes with 
the simple bremsstrahlung process in which the electron deposits the full momentum transfer 
A with one virtual photon, emitting the final state photon on its own. Unfortunately, due 
to the propagator of the virtual photon in this process, it wins the competition in the region 
of small —t relevant to spin physics. However, this competition is a mixed blessing since the 
interference between the two amplitudes allows a measurement of the phase associated with 
DVCS. Furthermore, in regions where the direct emission dominates, the interference actually 
enhances the effect. The Bethe-Heitler amplitude for direct emission is quite well-known. Since 
the only hadronic physics that enters it are standard electromagnetic form factors, there should 
be no difficulty in performing a direct subtraction to extract the DVCS contribution. Some 
estimates of the cross-section in several different kinematical regions is presented in Ref . |}8| . 
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Chapter 4 



The Structure Function /y(:r,/i 2 ) - 



Higher Twist Evolution 



Until this point, we have been studying only the leading contributions to scattering processes 
in the Bjorken limit. This led to distribution functions whose moments could be classified 
as twist-2. As we have seen, partons described by twist-2 distributions move collinear to 
the parent hadron. This is what we expect since transverse behavior is suppressed in the 
Bjorken limit. However, a complete understanding of hadrons requires information that is not 
contained in these collinear distribution functions. In particular, transverse momentum effects 
are intimately related to partonic correlations within hadrons. 

A simple example of coherent parton scattering is the interference of a single quark with 
a quark and a gluon in a nuclcon target. To describe this phenomenon, it is necessary to 
introduce a three-parton light-cone correlation function 



rather than the usual two-parton functions of the last two chapters. General parton correlations 
involve more than one Feynman variable, and hence their scale evolution is more complicated 
than those we have studied. 

Experimental study of parton correlations is challenging for a number of reasons. One is the 
lack of processes in which all Feynman variables in a parton correlation can be kinematically 
controlled. For instance, in lepton-nucleon deep-inelastic scattering with trasverse polarization, 
one can measure the structure function /t(x,^ 2 ). In the Bjorken limit, /t(x,^ 2 ) is related 




(4.1) 
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to a y-moment of the above correlation function. Since a moment of M a (x,y, fj, 2 ) does not 
evolve autonomously, knowing the entire /r(^,/i 2 ) at one scale is not sufficient to calculate it 
at another. This makes an analysis of fr{x,^ 2 ) data at different scales difficult. 

Several years ago, Ali, Braun, and Hiller (ABH) |3(| made a remarkable discovery that in 
the limit of large number of colors, N c , the twist-3 part of /t(x, fi 2 ) does evolve autonomously 
at the one-loop level in the nonsinglet sector. Their result has since been widely used in 
model calculations and analyses of experimental data Hj. More recently, similar results 
have been found for the evolutions of other twist-3 functions, h^x,^ 2 ) and e(x,(i 2 ) p2[ . 
Given the practical importance of the ABH result, a deeper understanding of the large- N c 
simplification is clearly desirable. Moreover, it is interesting to investigate the possibility of a 
similar simplification at two or more loops and for analogous twist-4 correlations. 

In this chapter, I present a direct calculation of the large- N c evolution of /t(s;,/x 2 ) in the 
light-cone gauge. This calculation is based on original research by X. Ji and I published in 



Section 4.1 is devoted to a discussion of the kinematics and relevant correlations in our 



process. In Section 4.2, the details of the calculation are discussed. Here, we will see that the 
autonomy of the twist-3 evolution arises from a special property of one particular Feynman 
diagram. Since this property is independent of the 7-matrix structure of the composite oper- 
ators inserted, the ABH result generalizes immediately to the twist-3 parts of hL{x,/i 2 ) and 
e(x, /i 2 ). Unfortunately, for various reasons, it seems there is no similar large- N c simplification 
for twist-4 functions nor for /t(;e,/x 2 ) beyond one loop. 



A summary of this chapter appears in Section 4.3. 



4.1 DIS With Transverse Polarization 

The process we wish to consider here has the same kinematics as that of Chapter but here 
we take the proton spin along a transverse direction (either the 1- or 2-direction). Of course, 
this means that we cannot take the proton mass strictly to zero because massless fermions do 
not have transverse spin components. However, we will ignore this quantity as small whenever 
it is subleading. 
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Since q ■ S = 0, the spin-dependent part of can be written 

TW = -ie^^qaSf,— {S 1 (u, Q 2 ) + S 2 {u, Q 2 )) . (4.2) 

Assuming a factorization theorem for this process, we can express its amplitude as a convolu- 
tion of a perturbative scattering coefficient function with certain nonperturbative distributions. 
As before, we write the quark density matrix^ 

M aff (x) = J ^e- iXx {PS\^ (Xn)MO)\PS) = \m(n^ + fT(x)M( l5 ^ • ( 4 -3) 

The first term is just the spin-averaged quark distribution function of Chapter 2. Since this 
term is spin-independent, the distinction between transversely polarized targets and longitu- 
dinally polarized targets is meaningless to it. 



In the limit M — > 0, the second structure reduces to h]/ 75 as in Section 2.3. However, its 
coefficient contains more than just the function we met there. The easiest way to see this is 
by examining its moments : 

Mx) = -±S a J^e- iX *(PS\^(\n)ri5m\PS) , (4-4) 
dxx^Mx) = -^S a {PS\lp 1 a l5 (id + ) n - 1 *p\PS) . (4.5) 

The local operators associated to the moments of fr(x) are very similar to those related to 
f(x). In fact, using Lorentz invariance we can write 

(PS\ Tpj^HW 12 ■ ■ ■ iV^jsip \PS) = 2a n MS^P^ ■ ■ ■ P» n) , 

(PS\^ + W + ■■■W + -/ 5 tp\PS) = 2a n MS + P + ■ ■ ■ P + . (4.6) 

Here, we have symmetrized all of the indices and removed all possible traces to form a twist- 
2 operator. Working with the kinematics of Chapter 2, we can extract the value of a n in 
polarized DIS : 

a n = f dxx^fix) . (4.7) 

Hence only the twist- 2 piece of fr{x) is known from DIS. The remaining twist-3 portion 
contains new physics. Let us see how fx contributes to at leading order. 

1 Here and for the remainder of this chapter, we work in the light-cone gauge n ■ A = 0. 
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(a) (b) 

Figure 4.1: Leading diagrams for 'naive' scattering with transverse polarization. 

The relevant diagrams for quark scattering at leading order are shown in Figure 4.1. The 
result, 

t ih „ ie ^ {MS) 2 A r dxh{x) _JL_ {xp + q)a 5 (4 . 8) 

v J x - x B 

does not satisfy the QED Ward identity, q^T^ v = 0, unless S and p are parallel. As usual, the 
failure of an amplitude to satisfy Ward identities indicates missing contributions. Here, we are 
being reprimanded for neglecting the contribution due to transverse quark momentum. Our 
quarks are massless, so their spins are aligned with their momenta. To support the transverse 
polarization of the nucleon, quarks must have a transverse component to their momenta. 

Revisiting the diagram of Figure 4.1, we take the incoming quark to have momentum xp+k± 
and expand the amplitude about k± = 0. The expansion of the intermediate propagator is 

' — J + 0(kl/Q^) , (4.9) 



xp'+fi+ft± xp'+fi xp'+i xp' + i 

so the leading transverse momentum correction takes the form 

ie\ f dx f d 2 k ± Tr y —L-i}£ ± —L^^ M (x,k ± ) . (4.10) 
q J J [ xp+d xp+i 

The transverse momentum distribution, 

M aP (x,k±) = J2^ PS \^^ X )( X \^\ PS ) S ( n -P x+x - 1 ) 6 (P^ x + k ^ 

x 

= f^^e- iXx e iZ± - k± ( p S\^^)M^)\PS) , (4.11) 

represents the probability of finding a quark with momentum xp+k± in a nucleon of momentum 
P and polarization S. Turning the factor of k± in our amplitude into a derivative via partial 
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Figure 4.2: The gluon correction required for QCD gauge-invariance in DIS with transverse 
polarization. 

integration, we arrive aiQ 



M a (x) is a new distribution representing the momentum asymmetry induced by the hadron 
polarization. 

In its present form, M a (x) is not gauge- invariant. Since transverse momentum cannot be 
separated from transversely polarized gauge fields, we must include these effects to achieve 
gauge invariance. This implies that the contribution represented by the diagram in Figure 4.2 
is necessary. In terms of the distribution 




(4.12) 




Gi(x,y) 





(4.15) 



where I have defined 



T „ _ e ^P Sv p xnp 



(4.16) 



to introduce the other transverse direction, and 



T> a (lin) ee d a +igA a {nn) 



(4.17) 



our full correction can be neatly summed up as 




(4.18) 



2 The _L on 7^ is to remind us that the sum over a includes only transverse dimensions. 
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Using the fact that p 2 = n 2 = 0, this expression can be further simplified to 



dx 



J x — xb 



Tr 



7 M ^7a il v I dyM a (x,y)+rfiltp / l y I dyM a (y,x) 



Substituting (4.13) and using the relations 



Gi(x,y)= Gx(y,x) 
G 2 (x,y) = -G 2 (y,x) , 

which follow from the reality of our distributions, leads to 



2e 2 



dx 



dy (Gx(x,y) + G 2 (x,y)) 



(4.19) 

(4.20) 
(4.21) 

(4.22) 



X — XB 

Apparently, only the first moments of Gi t 2(x, y) with respect to y enter our process. From 
their definitions, one can show that 



dy [Gi(x,y) + G 2 {x,y)] 
1 f dX 



M / 2tt 



(PS \ip(Xn)iVl(Xn) (iT„t - S a fi lz ) ^(0)1 PS) 



(4.23) 



Since the expression in parentheses is just one of the many ways to say i "f a fi ^75', the equation 
of motion in this gauge, 

fttfid+ij) = -fLityxip (4.24) 



allows us to write finally^ 



dy[Gi(x,y)+G 2 (x,y)]=xf T (x) . 



(4.25) 



Adding ( 4.22 ) to fl4.8|) , we see that this contribution does indeed cancel the gauge dependence 
of our initial result. The complete amplitude,^ 



T^=ie^q a {MS)p 2 -^- f dx f T (x) (— — 

V J \x — XB X + XB 



(4.26) 



is fully gauge-invariant. In fact, the only difference between transverse and longitudinal po- 
larization appears to be in the relevant distribution function. From the point of view of the 
operator product expansion, this result is obvious. Since the OPE makes no reference to the 

3 In order to get rid of the structure yift—fifi, we must assert the reality of our distributions one more time. 
4 we can't forget the crossing contribution! 
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polarization of the target, the coefficient of the twist-2 contribution must be the same for both 
processes. However, the twist-3 contributions are not constrained in this way. In fact, from 
the analysis above it seems like a miracle that only the simple distribution /t(^) appears in 
the final results rather than Gi(x,y) and G2(x,y). We certainly cannot expect this behavior 
to persist in the next order. 

The calculation above is extremely disconcerting. It seems very strange that we had to 
consider gluon scattering and transverse momentum effects to obtain QED gauge invariance. 
In a fundamental sense, it seems that this result should've been automatic. However, the 
Feynman diagram in Figure 4.1 implicitly assumes that we can interpret the scattering within 
the parton model. The trouble with a parton model interpretation of fr(x) can easily be 
seen by quantizing the theory on the light front. Rather than the treatment discussed in Ap- 



pendix D.l, one can impose canonical commutation relations on the fields and their conjugate 
momenta at equal values of x + In this formalism, the '+' direction becomes the new 

'time' variable. For massless field quanta moving in the + direction, this is a very logical 
choice for the time axis. In light-front quantization, the quark field contains two very different 
components. Defining 

il>± = i 7 T 7 ± V' , (4.27) 
we see that the equation of motion constrains ip- : 

iMAn) = -J-^ 7iipj_(An)^+(An) . (4.28) 
I in ■ a 

Physically, this means that only contains freely propagating^] quark quanta. The j~ in the 
definition of has annihilated all of the free portions of the quark field since these satisfy 
$u(xp) = 0. Looking at fr{x), 

It{x) = ~ll Sa l ^ 6 ' lXX [ ( p S|?L(An)7 Q 75<M0) |PS) 

+ (PS\^ + (Xn)n^-(0)\PS)] , (4.29) 

we see that it contains both the 'good' component ip+ and the 'bad' one tp- . 

The external lines in our Feynman diagram represent free quark fields;^] in order to accu- 
rately represent fr(x) with a diagram, we must first eliminate all of the bad quark components 

5 in the + direction 



(i 



Our Feynman rules have been derived in this way; see Section 



1.2 
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Figure 4.3: These one-particle reducible diagrams represent the true nature of figure 4.1. They 
are obtained by eliminating the 'bad' component of ip via the light-cone equations of motion. 

in favor of the good ones.[[| This process turns Figure 4.1 into Figure 4.3. An explicit calcula- 
tion shows that the contribution from the graphs of Figure 4.3 is identical to (4.8).[] However, 



from Fig. 4.3 it is quite clear that the contributions of Figure 4.2 are necessary to ensure 



QED gauge invariance. The point here is not that (4.8) is wrong, just incomplete. The form 



of (4.4) misleads us into thinking that it is the only contribution; the more honest form (4.29) 
reveals the true nature of transverse scattering to be inextricably coupled with multiparton 
correlation effects. We will see that the effect of this coupling has consequences far greater 
than the inclusion of a few more diagrams. 



4.2 The Evolution Equation 

Now that we have seen that transverse polarization cannot be decoupled from coherent partonic 
correlations, it is natural to ask what kind of effect this has on the analysis of the previous 
chapters. There, we were able to factorize the DIS cross-section into soft and hard sectors due 
to the incoherence of the dominant scattering mechanism. It is clear that the introduction of 
correlation effects will have some kind of an impact on this analysis, but its full consequences 
are not obvious. 

One immediate effect of coherent scattering is a complication of the soft sector of our 

7 Note that this requirement is quite independent of QCD. We would have to perform this manipulation even 
in a theory without local gauge invariance, where it would lead only to the transverse momentum correction. 

8 Strictly speaking, these gluons must be combined with the related transverse momentum insertion to 



reproduce (4.8). For this reason, it is often advisable to begin with bad components and eliminate them via 

(Eii). 
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amplitude. If a factorization theorem does indeed exist at this level, the matrix elements 
representing the soft physics certainly must encode a great deal more information than those 



previously studied. The distribution in Equation (4.4) does not look much different from those 



we met in Section 2.3, but we have just seen that this form can be quite misleading. 

A reliable way to see how much information a distribution contains is by studying its scale 
evolution. For a self-contained distribution which evolves autonomously, only one measurement 
is enough to determine the complete behavior. On the other hand, distributions which mix 
with each other require external inputs for a study of their scale dependence. As we have seen, 
mixing is an indication that a distribution does not contain all of the relevant physics. 

Since operators of different twist do not belong to the same representation of the Lorentz 
group, we must separate the twist-2 and -3 contributions to M ) i n order to study its 

evolution effectively. This is most easily done with the local operators that constitute its 
moments rather than the function itself. The (n + l)-st moment of Jt(x, /i 2 ) is written 

J dxx n f T (x,^) = ^n^---n, n ( x Ps\9^-^\ps) , (4.30) 

where 

Qv{m-nn) = ^y 75 i2>(M . . . iV^xp , (4.31) 

with (/ii • • • fx n ) indicating symmetrization of the indices and removal of the traces. The twist-2 
0(crfj,x---Hn) (totally symmetric and traceless) and twist-3 Q^^i]^--^™) ( m i X ed symmetric and 
traceless) contributions, where [u/ii] denotes antisymmetrization, can now be separated quite 
easily via 

where the ellipses denote terms of higher twist. 

As mentioned above, the twist-2 part of 6 n is identical to the operator related to the 



(n + l)st moment of f(x). Its evolution has already been studied in Section 2.5. We argued 
in that section that the twist-2 nonsinglet quark operators must have autonomous evolution 
because there are simply no operators with which they can mix. This statement is not true at 
the twist-3 level. A complete basis of twist-3 quark operators was first identified in |84| , 

R? = iiW^ 1 ■ ■ ■W^- 1 (-ig)T' J ^W^+ 1 ■ ■ ■ W^- 1 ^^ 

= ^W^ 1 ■ --W^- 1 gP^HV^ ■ ■ . iDMn-iyO ^ , (4.33) 
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where i = 1, n — 1. The operator ^^ M1 ^ M2 " is just a special linear combination of them: 



1 n— 1 

J2(n-i)(Ri-K-i + Si ■ (4.34) 



" 3 ~ 4n 

1=1 

The anomalous dimension matrix in the above operator basis was first worked out by Bukhvos- 
tov ct al. and later reproduced by a number of authors with different methods [ ^5| . The 
result is what one would generally expect. To evolve the matrix element of O^^ 1 ^ 2 '"^^, it 
is not enough just to know it at an initial scale - one must know all the matrix elements of 
WJ 1 = -R™ — Rn-i + $f + ^n-i there. Important physical information pertaining to the soft 
physics of our process is simply not probed by the experiment. This is actually quite clear from 
the discussion in the last section. There, we saw that the distributions G\{x,y) and G2(x,y) 
are in some sense more fundamental than Jt{x). The local operators i?" and S™ are related 
to the moments of these distributions. 

By studying the anomalous dimension matrix in the large- N c limit,^] Ali, Braun and Hillcr 
found that the eigenvector corresponding to the lowest eigenvalue is just the linear combination 



of twist-3 operators on the right-hand side of Eq.( 4.34 ). In other words, the twist-3 part of 
fr(x, /J?) evolves autonomously in this limit. To better understand ABH's result, we calculate 
the large- N c evolution of fr(x, /i 2 ) directly. Starting with the mixed-twist operator ^0»i/*2— m») 



in Eq.(4.30), we look for possible divergences when inserted in multi-point Greens functions. 
To reduce the number of Feynman diagrams, we choose the light-cone gauge A + = and take 
the _!_+■■■+ component of the 0-operator. The resulting operator, 

n = 0i + - + = ^r L 7l5 (iS+) n ^ , (4.35) 

has the simple Feynman rule 7" L 75(fc + ) n , where k is the momentum of the incoming quark. 

By light-cone power counting, we need only consider two- and three-point functions. Since 
the external lines carry color, we must ask what type of diagrams dominate the large- N c limit. 



As shown in Appendix CJ5 , the simple rule is that when all external lines are drawn to one 
point (infinity) the planer diagrams are leading. All one-particle-irreducible (1PI) leading two- 
and three-point diagrams with one 9 insertion are shown in Fig. 4.4. 

The ultraviolet divergences in the two point Greens function can obviously be subtracted 
with the matrix element of 9 n itself. The only diagram in which the divergences may not be 

9 Largc-7V c physics is discussed in Appendix |C.q. 
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(b) 



Figure 4.4: The (a) two- and (b) three-point 1PI Feynman diagrams contributing to the 
evolution of 9 n in the large- N c limit. 

subtracted by n is Fig. 4.4b. An explicit calculation shows that the ultraviolet divergences 
here require a subtraction of the local operator 



1 g 2 2 ( u 2 e<* 



-e/2 



1 n— 1 

(n + 2) ■* — 



i=0 



+ ( E y - ^TT) ^ 1 ^ ^7 5 (ia+)"-V + ^9 + r~ V75 ^ * 



,(4.36) 



2"" 1 8tt 2 e V 4tt/^ 

1 

"3(71+1) 

where I have neglected the contributions of light-cone singularities which will be canceled 
eventually. Notice that the first term is present in the twist- 2 operator 

^2 = ^ (^ 7 s 5 ^ + rv'+|]^7 + 75^ + rii? ± ( l a + r-' i -v) , (4.37) 



and the remaining two terms can be converted to 9 n by using the equation of motion (4.28). 
Thus we easily arrive at the ABH conclusion that 9 n 3 evolves autonomously in the large- N c 
limit. 

Including the contribution from Fig. 4.4a as well as the wavefunction renormalization, we 
obtain the evolution equation 

/ n+l 



2 dO n _ a sip 2 ) N c 
1 dp 2 ~ 



2tt 2 



n + l 
( 

n + 2 



1 1 



1 



i n+l 2 



(4.38) 



Separating out the twist-2 and twist-3 parts, we not only recover the well-known twist-2 
evolution,^] 



n+i 1 > 

«7 ' 



■ d6 n2 = a s {n 2 ) A, / 3 1 
dp? 2ir 2 ^2 (n + l)(n + 2) ^/ 



(4.39) 



Note that this is the evolution of the (n + l)st moment of the distribution rather than the nth. 



192 



but also the twist-3 result 

,2\ AT / "+ 1 



^^^/_ 2 gl + _l_ + ij e _ (4 , 0) 



which is identical to the result in Rcf. |3£ 



It is quite clear that the i-independence of the coefficients in the sum in Eq.( 4.3q ) is the key 
for the autonomous evolution of 9 n ^. This property is not totally unexpected if one inspects 
Fig. 4.4b more closely. Interpreting this diagram in coordinate space, we see that the internal 
gluon propagates homogeneously from one quark to the other. By homogeneously, we mean 
that at any point along the path of the propagation, the gluon behaves exactly the same way, 
except, of course, at the points where the gluon and quarks interact. The spatial location of the 
interaction with the external gluon determines the number of derivatives before and after the 
gluon field in the subtraction operator. Since the internal gluon propagation is homogeneous, 
different locations of the triple-gluon vertex produce similar physical effects. Therefore, the 
coefficients of the subtraction operators ip fij5(id + yiD- L (id + ) n ~ 1 ~ l tp will be independent of 
i. The two extra terms in Eq.( f.3(j ) correspond to the triple-gluon vertex just next to the 
external quark lines, where the homogeneity is lost. However, these diagrams cannot break 
the symmetry since they can always be discarded in favor of the 'bad' quark component via 



the equation of motion ( 4.28 ). 

Since the homogeneous property of the internal gluon line is independent of the 7-matrix 
structure of the operator inserted, we conclude that the other twist-3 distributions e(x,fi 2 ) 
and /ii(x,/i 2 ) ^] also evolve autonomously in the large- N c limit. A quick calculation confirms 
the evolution equations found in Ref. [ §2| . 

Encouraged by the success of the above approach, we apply it to the analogous twist-4 
evolution. In Ref. |86[ , the three one-variable distributions fi(x,fj, 2 ), g%{x, /i 2 ) and h^{x,ii 2 ) 
are shown to contain twist-4. For example, Ja{x) is defined as 



It was shown in Ref. |87[] that f±(x,Q 2 ) contributes to the 1/Q 2 term of the longitudinal 

11 These distributions are denned in 
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Figure 4.5: Four-point 1PI Feynman diagrams contributing to the evolution of O in the large- 
N c limit. 

scaling function Fl of the nucleon 

^ a 

where we have neglected higher-order radiative corrections. Here, autonomous evolution of 
/4(x,/x 2 ) would simplify the analysis of Fl data immensely. 

In the large- N c limit, we consider one insertion of the operator O = ip^f~ (id + ) n ip into 
two-, three- and four-point Green's functions. At one-loop order, the 1PI two- and three-point 
diagrams arc identical to those in Fig. 4.4 and the 1PI four-point diagrams are shown in Fig. 
4.5. Only the three and four point diagrams can potentially destroy the autonomous evolution 
of O. Let us start with Fig. 4.5a. One of its divergent contributions introduces the local 
subtraction 

Y^~^iy>^d + ) l iy>±(id + ) n - l - 2 il> + h.c. , (4.43) 

i 

where the coefficients are independent of i because of the homogeneity of the gluon propagator. 
Using the equation of motion, we can write this as 

J2i>(id + Ti y>±{id + ) n - l - 2 ijj + h.c. (4.44) 

i 

Since this operator cannot be reduced to either the twist- 2 or twist-4 part of O, the evolution 
of the latter cannot be autonomous unless this contribution is canceled by other diagrams. The 
only other diagram containing the same divergence structure is Fig. 4.4b. Unfortunately, our 
explicit calculation did not produce this cancelation. Here, it is not the inhomogeneity of gluon 
propagation which destroys the autonomous evolution. In the twist-4 sector, there are simply 
more operators which do not distinguish between quark fields (are derived from homogeneous 
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(a) (b) 

Figure 4.6: Some two-loop 1PI Feynman diagrams that might break the autonomy of the 6 n 
in the large- iV c limit. 

propagation) than appear in the decomposition of O into twist- 2 and -4 contributions. The 
same phenomenon occurs for the twist-4 part of gz{x,y?) and hs(x,^ 2 ). 

Thus, the large- iV c simplification seems to happen only for the evolution of the twist-3 part 
of Jt(x,h 2 ), fiL(x,[i 2 ) and e(x,[i 2 ). Does it happen for them at two and higher loops? In 
Fig. 4.6, we see two examples of Feynman diagrams suspected to break the autonomy of the 
8 n 3 evolution, i.e. they may contain divergences that cannot be subtracted by 8 n2 and 9 n s 
only. The inhomogeneity of the gluon propagator leads to this suspicion. The internal gluon 
that propagates from one quark to another has different wavelengths in the different parts of 
the propagation. Its interaction with the external gluon will therefore be different at different 
spatial locations. Thus the subtraction operators have different coefficients depending on the 
number of derivatives before and after the external gluon field. An explicit calculation of Fig. 
4.6a confirms this suspicion. 

This leaves us with only one possibility for autonomous two- loop evolution of 6*„ 3 : the 
unwanted structures cancel in the sum of all large- N c two-loop diagrams. Calculating all those 
diagrams is a big task. However, even without an explicit calculation, we do not expect the 
cancelation to happen. The fundamental reason is that the large- N c limit represents only 
a selection of a subset of Feynman diagrams, whereas the result of an individual diagram is 
independent of this limit. Cancelations of a structure do not generally happen among Feynman 
diagrams unless there is a symmetry. 
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4.3 Conclusions 



Although deep-inelastic scattering with transversely polarized targets seems quite similar to 
that with longitudinal polarization, a closer look reveals it to be quite deeply entangled with 
coherent scattering effects. This entanglement stems from the fact that transverse polarization 
induces a transverse momentum distribution which cannot be decoupled from the effects of 
physical gluons. 

A study of the evolution of the distribution function relevant to this process unearths a 
large number of coupled operators which have been obscured from view by the equations of 
motion. These operators produce an extremely complicated evolution for the twist-3 part of 
the measured structure function, with the effect that scale variation of the distribution function 
cannot be performed until one knows the full functional form of the more fundamental dis- 
tributions G\(x,y) and G2(x,y). Unfortunately, these distributions are not readily extracted 
separately in experiment. Only the combination appearing in fr(x) shows up at leading order 
in DIS. 

A calculation of Ali, Braun, and Hiller shows that the anomalous dimension matrix of 
these operators simplifies considerably in the limit of large number of colors. The result of 
this simplification is that the twist-3 part of fr{x) evolves autonomously in one- loop after all. 
Corrections to this evolution appear at 0(1/N^), so it should be trusted to approximately 10%. 
This amazing result represents an incredible simplification of the data analysis on polarized DIS 
and allows us to extract valuable information on gluon-quark correlations within the proton. 

This simplification naturally leads us to ask if there are any other cases in which it occurs. 
Later calculations have shown that similar simplifications do indeed occur for the twist-3 parts 
of other distributions relevant to experiment. An extension of the method used by Ali, Braun, 
and Hiller to twist-4 and higher or to two or more loops would be a heroic undertaking. Here, 
we have shown how to avoid a calculation of the entire anomalous dimension matrix and get 
their result directly. Furthermore, using this method, one can see clearly why the simplification 
occurs. 

We have seen that the large- iV c simplification happens because this limit, quite accidentally 
it seems, happens to throw out exactly the diagrams which induce mixing. In the light-cone 
gauge, the simplification can easily be traced to a special property of Fig. 4.4b. Unfortunately, 
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this special property is not enough to ensure autonomous evolution of the analogous twist-4 
distributions at one loop, and does not persist to higher orders in the twist-3 case. Nonetheless, 
the discovery of Ali, Braun, and Hillcr remains as a significant step forward in the study of 
the /-r(a;,/i 2 ) structure function. Without the autonomous one-loop evolution, an analysis of 
experimental data on the twist-3 contribution would be severely constrained. 
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Appendix A 



Standard Conventions and Formulae 



This Appendix is meant to provide a table of standard formulae and conventions used in the 
text. 

We work in 'God-given' units, where c = h = 1. 
Throughout, I use the metric 



f 1 








^ 





-1 














-1 





Vo 








-J 



of signature -2. 

The Levi-Cevita symbol is defined via e 0123 = +1. 

When written explicitly, I use the convention that the time component of a 4-vector goes 
first, followed by space. For example, p^ — (p° ,p x ,p 2 ,p 3 ). 

Indices in Greek at the beginning of the alphabet (a, /?,...) are usually Dirac indices. 
Exception : a and j3 are fair game for use as Lorentz indices. 

Indices in Greek in the middle of the alphabet (/x, v, . . .) are usually Lorentz indices. 

Indices in Latin at the beginning of the alphabet (a, b, . . .) are usually in the adjoint rep- 
resentation of SU(3). 

Indices in Latin in the middle of the alphabet are usually either spatial indices or 

in the fundamental or conjugate representation of SU(3). 

Context is really the final word here. If it should be a Lorentz index, it probably is... 
The summation convention is everywhere in effect (unless specified otherwise). This means 
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that any repeated index in the same term of an expression is meant to be summed over all 
possible values of that index. In certain spaces, invariant summing requires one contravariant 
(up) and one covariant (down) index. 

Suppression of indices implies contraction in the most obvious way. Either last index with 
first index or covariant index with contravariant (read from left to right). Exception : for 
double contraction on two indices my convention is 6^ J^ v = 9J. 

The difference between the index i and the V^T = i should be clear from context. I have 
tried to avoid grossly misleading presentations, but borderline cases may be present. 

[, ] denotes the commutator, defined by [A, B] = AB — BA. 

{, } denotes the anticommutator, defined by {A, B} = AB + BA. 

The symbol ~ is used to indicate either approximate equality or equality in all important 
respects. In most cases, it can be read, "is essentially the same as" . 
The conjugation operators I employ are 
T* for complex 

for Hermitian 
T for Dirac. 
The notation ^ is shorthand for 0^7^. 

The absence of an explicit matrix structure where one should be indicates the presence of 
the identity. 

The absence of integration limits implies integration over the entire relevant space. Some- 
times, integration is arrested by the support of its integrand. 

I do not differentiate explicitly between scalars and 4-vectors. This should be clear from 
the context. Spatial 3-vectors always appear with vector hats, i.e. v. 

I do not specify the number of <5-functions appearing in an expression explicitly unless this 
is not obvious from the context. For example, S(p — p' ) = 5{p 1 — p' 1 )S(p 2 — p' 2 )S(p 3 — p' 3 ) . 
One important exception is the volume of spacetime, (2tt) 3 S^ 3 \0). 
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A.l Dirac Algebra 



The 7-matrices are d 2 d / 2 x 2 d//2 -dimcnsional operators on the 2 d / 2 -dimensional Dirac spinor 
space defined by the Clifford algebra 

{y, 7 "} = 2^"l , (A.2) 

where is the metric tensor of our space. In four dimensions, one defines 

75 = *7°7 1 7 2 7 3 = - l -e^P 1 ^ vlxlp (A.3) 

so that the projection operators 

Pr = \{1+1s) Pl =\{l -75) (A.4) 

project out positive (right-handed) and negative (left-handed) helicity states, respectively. The 
Weyl basis is defined by the choice 

/ 

r=[ , (A.5) 

where a 1 are the Pauli matrices, 

''■Co)- ''■Co)- -"-(o-J' ,A6) 



(T° is the 2x2 identity, and 

a" = 

In this basis, 75 takes the form 



(A.7) 



75 = ( o 1 1) ' (A ' 8) 

where 1 represents the 2x2 identity. The generators of Lorentz transformations, 

^ = | [7^,71 , (A.9) 

take the form 

(a i \ 

n° l = -i[ (A.10) 
VO -a* J 

(a k \ 

a ij = e i3k . (A.ll) 

\0a k 
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Under Hermitian conjugation, one has 

( 7 f)t = 7 %m 7 o ( A12 ) 

(a^) 1 = 7 t V y 7° (A.13) 

7 5 f = 75 = -7°757° • (A.14) 

These matrices are linearly independent and, along with the identity and 7^75, form a 
closed set under multiplication in d = 4. The relations 

75 = 1 (A- 15) 

^75 - \i^ vXp cJx P (A. 16) 

yy = g^-ia^ (A.17) 

7^ 7 V = c^V - V + + ^ Ap 7 P 75 (A.18) 
^Y^ X Y = g^ u g Xp - g^ x g up + g uX g pp 

-i( g" v a xp -g" x a vp + g vX ^ p (A.19) 
+ 5 Ap ct^ - g vp a^ X + g pp o vX ) - ie pvXp ^ 

simplify the decomposition. In d dimensions, the identities 

7V7A = (2-d)r (A.20) 

l X Yl u l\ = + {d-2) 1 ^ 1 v = Ag^ v + {d-A)Yl v (A.21) 

7 A a^ 7 A = (d-4)<^" (A.22) 

7V7V7A = -2 7 P 7V + (4 - d)Yl v l P (A.23) 

7 A cr^7 a 7A = 27 a cr^ + (4 - d)a^7 Q (A.24) 
7 * 7 Miy*2 . . .y*n 7 ^ _ 27 A '"7 A ' 1 7 A ' 2 • • • 7**n-i 

+ (-1)™ ^'VW 4 ' ' ' 7 M " + (4 - d)^ 1 ^ 2 ■ ■ ■ 7^] (A.25) 

n-3 

+ 2^(-l)V"-* 7 W ... 7 Mn-i-l 7 ^n- 4 +l ... 7 Mn . n>3 

i=l 

prove useful in reducing products of 7 -matrices. 

In Feynman diagrams, one is often faced with traces of 7-matrices. Due to the fact that 
the identity is the only matrix in our space whose trace is nonzero, a nonzero traces implies 
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that for each matrix 7" in the trace there is another matrix 7^ such that ^"jP is proportional 
to the identity. With this in mind, on can show that the trace formulae 

Tr [7^] = Acf u (A.26) 

Tr [7V7V] = 4 (<f V p " g^sT + ff v V) (A.27) 

Tr [7^ • • • 7^] = n odd (A.28) 

Tr^ 1 ...7^"] = Tr[ 7 ' 12 " ••• 7 ^ 1 ] (A.29) 
Tr [^7 Q ^7 /3 ^7 Q ^ 7(3 ] = (d - 2) {(4 - d)Tr [ft jiff] - 16(o • c) (b ■ d)} (A.30) 
= 4(2 - d) {(8 -d)(a-c) (b ■ d) 

+(d - 4)[(o • 6) (c • d) (A.31) 
+(o-d) (6-c)]} 

TrlMf-ft-ft-lai-lfi] = (d-4)(6-d)TH*JWM 

+4Tr{^[(4-d)(p-fc (A.32) 

-2p-g #]} 

are valid in d dimensions. [] In general, the trace of an even number of 7-matrices is 4 times the 
sum over all different pairings of indices multiplied by the sign of the permutation. Written in 
this way, the trace of 2n 7-matrices has (2n)!/2 n n! terms. 
The trace formulae 

Tr[ 75 ] = (A.33) 

Tr [757^1 = (A.34) 

Tr [757^7*7"] = -4ze^ Ap (A.35) 

have meaning only in 4 dimensions. 

A.2 SU(N) Algebra 

The (N 2 — 1) generators t a of the special unitary group in N dimensions normalized to 

Tr (t a t b ) = -5 ab (A.36) 



1 Since the trace of the identity is an overall factor, we have taken it as 4 in d dimensions. 
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satisfy the Lie algebra 

[t a ,t b ] = if abc t c , (A.37) 

which define the totally antisymmetric structure constants f abc of the group. The generators 
also satisfy the Fiertz identity 



(0}(Of 



5\6 k 3 



AT- 7 



(A.38) 



For N = 3, we can use the representation 
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(A.39) 



(A.40) 



(A.41) 



(A.42) 



in which the nonzero structure constants are 



pabc 



1 2 3 

1 4 7 

1 5 6 

2 4 6 

2 5 7 

3 4 5 

3 6 7 

4 5 8 
6 7 8 



i 

2 

_ 1 
2 

1 

2 

1 
2 

1 

2 

_ 1 
2 

& 
2 

V3 
2 



(A.43) 
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The symmetric structure constants of the group, d ahc , are defined by 

{t a ,t b } = l-S ab + d abc t c . 



In terms of these constants, one can write 

1 / 1 ™ 



t a t b 



2 \N 



if abc t c + d abc t c 



The Jacobi identities, 



[F a ,F b ] = if abc F c , 
[F a ,D fc ] = if abc T> c , 



where (F a ) 



a \b c 



if abc and (D 



a\bc — J abc 



and the relation 



jabe jc.de ^ac^bd _ ^ad^bc^ _j_ f^^ace^bde _ ^ade ^bce^ 



(A.44) 

(A.45) 

(A.46) 
(A.47) 

(A.48) 



follow from repeated use of ( A. 37 ) and ( A.44 ). These expressions can be used to show the 
validity of 

N 1 



t a t a 



t b t a t b 



C F 



Ti[t a t b t c 
Tr [F a D fc 
Tr [F a F fc 
Tr [D Q D fc 

Tr [F a D b D c 

Tr [F Q F b D c 
Tr [F a F b F c 
Tr [D a D b D c 



2 2N 
2 

(d abc + if abc ) 



27V 



= C A S 



ab 



V - — ) S ab 



N ' \ ' r 



a be 



N 



■d 



abc 



N y, 
— if abC 

2 J 
N 6 

T ~ N 



i abc 



(A.49) 

(A.50) 

(A.51) 
(A.52) 
(A.53) 
(A.54) 

(A.55) 

(A.56) 
(A.57) 
(A.58) 
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A. 3 Covariant integration 

The standard integrals one encounters in a dimcnsionally regularized quantum field theory 
with covariant gauge-fixing are of the form 

r 1 <i d h Np i 

/ n Nm > {mk}) n » (A.59) 
j i=i y ' i=i 1 » 

where {Zj} are the loop momenta, {fc;} the propagator momenta, {mj the relevant masses, and 
{pi} external momenta. N is a definite function of the loop momenta (or, if one prefers, the 
propagator momenta), the external momenta, and the masses. The propagator momenta are 
necessarily linear in the loop momenta and can be chosen to have coefficient 1, so the technique 
of Feynman parameters (along with the use of Lorentz symmetry), can always reduce this 



integral to the form of (E.23) at the expense of introducing N p new integration parameters. 



His technique is based on the identity 
Differentiating with respect to B gives 



(A.60) 



1 r 1 2y 

dx dy (5(1 — x — y) - — —^7 , (A.61) 



AB 2 Jo [xA + yB] 

which immediately allows one to derive 

dw dx dy dz 5(1 — x — y) 8(1 — z — w) 



ABC J tf v y ' y ' [wxA + wyB + zCf 



1 2 
dxdydz6(l-x-y~ z)—- — 13 ■ (A.62) 

o [xA + yB + zCy 

This process continues; it is easy to show by induction that the formulae 

1 



Af- A n 



f d Xl --- dx n 8 (V x t - l) r (n) (A.63) 



■ ,„ tl 1 jnr n = f 1 dx 1 ---dx n sC£x i -l) UxTi ~" (A.64) 

hold for integer rrii. Furthermore, the right hand side is an analytic function of rm that can 
be continued throughout the complex plane. I find the decoupled formulae 



A x ---A 



n+l Jo 



1 _f\. ' ' : " • 111 



dxi ■ ■ ■ dx n 



x ■ 



E^i-^rrS 



71+ 1 



(A.65) 
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1 



A™ 1 • ■ • A 



n+1 



[n^rmfa-^rr 1 . 



x i) n x o 



r(E mp 



(A.66) 



E^(i 

where I have defined Xn+x = and Y[° = 1) much more useful in practical calculations. 

Once one has combined all of the denominators in a Feynman integral, one can shift the 
integration variable to get rid of all of the directional dependence of the resulting 'monster 
denominator' via 2 

-1 71+ 1 



l 2 (l- Pl f(l-p2f---(l-PnY 



(5Z x iPi) 



(A.67) 



Any directional dependence in the numerator is illusory, as one can always remove external 
vectors from the integral, leaving a free index^] so we are left with a directionally independent 
integral. Using the symmetry of the problem, one can now eliminate all free indices via 
d d l l^(l 2 ) a 



(2ir) d (I 2 + NP)P 
d d l lH v (l 2 ) a 







d d l 



(I 2 ) 



2\ct+l 



(2ir) d {I 2 + M 2 f ' 



(A.68) 
(A.69) 

(A.70) 



(27r) d (I 2 + M 2 Y d 

or the higher-rank equivalent. The remaining d-dimensional integral is evaluated via 

f _d^H_ (l 2 ) a 1 ( 2) d/2 +a -p T{d/2 + g)T(p - a - d/2) 

J (27r) d (I 2 + M 2 Y (4ir) d / 2 [ ' > T(d/2)T((3) 

Since this always gives a power of M 2 , which is invariably quadratic in the loop momenta, this 
procedure continues until all of the d-dimensional integrals have been taken. One is then left 
only with one-dimensional integrals, which are in principle straightforward. 
The following formulae are useful in this context 



T{z)T{l-z) 
'z + l 



sm7rz 



= 2" 



- r(z + i) 
r(i + z/2) 



(A.71) 
(A.72) 



This expression assumes the use of the coupled parameter formula (A. 63) and ignores masses. The sums 



run from 1 to n, with determined by the 5-function in (A. 63). 

3 In multi-loop calculations, one may find one's self with fractional powers in the numerator. These powers 
are to be considered part of the denominator and combined via the above approach. One must be careful in 
these situations since fractional powers can easily lead to ambiguities in the complex plane. To treat these 
integrals consistently, one always performs Wick rotations before combining denominators and integrating. In 
this way, there will always be an integer number of signs. 
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T(2z) = 2 2z - 1 tt- 1 ^T(z)T{z + 1/2) 
log[r(l + e)] - - lE e + Y / [ -^ L an)e n 

n=2 



C dxx^il-xf- 1 = ^ 
Jo r ( a 



r(a)r(/3) 

+ /?) ' 



For more complicated calculations, 

/ dxdyx a - l {l - scy-V -1 ^ - y) Q_7_1 (l - <By)" 



r(/?)r( 7 )r(q + /3 + M - 7 ) 

r(a + /3 + it)Y{a + (3 - 7) 



may also be of use. In addition, the integrals 
d d k 1 



(2ir) d (fc 2 )«[(fc-( ? ) 2 ]/ 3 

_ (- q 2)d/2- a -fJ r ( rf /2 - /3)r(rf/2 - a)T(a + (3- d/2) 



(47r) d / 2 



r(«)r(/3)r(d - a - p) 



(A.73) 
(A.74) 

(A.75) 



(A.76) 



(A.77) 



d d k 



(27r) d (fc 2 )«[(fc- g ) 2 ]' 3 

( _ g 2 )d /2-q- j3 r(d/2 _ md/2 ± 1 _ Q)r(a + g _ d/2) 

(4^) d / 2 r(a)r(fl)r(d + 1 - a - 8) q ' 1 ' 



d d k 



k^k h 



{2ir) d {k 2 ) a [(k - q) 2 Y 

(_ q 2y/2- a -0 p( d / 2 - 3)T(d/2 + 1 - a)T(a + (3- d/2) 



( 47r )<i/2 



T(a)T(p)T(d + 2 - a - j3) 
- J- 1 - a ) 



2 a+8-d/2-l q y ' \2 



(A.79) 



k»k v k x 



{2-kY {k 2 ) a [(k - q) 2 Y 

_ (_ g 2)d/2-a-/3 p( d / 2 - /3)r(rf/2 + 2 - a)r(a + /3 - d/2) 



( 47r )<i/2 



r( a )r(/?)r(d + 3 - a - p) 

d/2- 13 



1 



2 a + (3-1- d/2 



d 



q 2 (g^q x + g vX q» + g^q") (A.80) 



+ ^ + 2 - a) «"«V 

can be done in general and have simple forms. These integrals have already been Wick rotated 
to Euclidean space. Loop momenta with free indices have been left alone, so one must be 
careful when using these formulae with internal contractions. 
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In all the above formulae, T(n) is the T-function, defined via 



n— 1 „— x 



r(n) = / dxx" ( 
Jo 



for fen > and either the reflection identity ( A. 71 ) or the recursion relation 

nT(n) = T{n + 1) 



(A.81) 



(A.82) 



elsewhere. It is a meromorphic function of n throughout the complex plane with simple poles 
at the negative integers and zero, with residues 

(-!)» 



Note that 



Res[r(z),-n] 



r(n+l) = n\ 



ri-l = V. 



Its expansion involves the Euler-Masceroni constant 



7b = lim 



^ ' 7 



and the Riemann Zcta function, 



oo 1 



for 3?e s > 1. For other values in the complex plane of s, it is defined via 



C(«) = 2'7r- 1 Bin— T(l - S )C(1 - s) 



and found to be a meromorphic function with a simple pole at s = 1 with residue 



Res[C(s),l] = 1 



Values of C(s) at positive even integer s are known via the expansion 



e x - 1 



£ C(2n), 2 » 



In particular, 



C(2) 



7T 

~6~ 



C(4) = — 
sv ; 90 



(A.83) 

(A.84) 
(A.85) 



00 / l V 1 

= V - — — CM ~ 0.577 215 664 901 532 861 (A.86) 



(A.87) 
(A.88) 

(A.89) 

(A.90) 

(A.91) 
(A.92) 
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Using these results in conjunction with ( A. 88 ), one can derive 

C(o) = 
C(-i) = 

C(-3) - 



1 

" 2 
1 

" 12 
1 

120 



In addition, consistency between ( A. 83 ), ( A. 87 ), and ( A.88| ) implies 



C(-2n + e) ( 2 ^C(2n + l) 



for positive integer n. 

For completeness, I also include the standard trancendentals, 

7T - 3.141 592 653 589 793 238 
e - 2.718 281 828 459 045 235 , 



(A.93) 
(A.94) 
(A.95) 

(A.96) 



(A.97) 
(A.98) 



that appear in our relations. 



A. 4 QCD and Feynman's Rules of Perturbation Theory 



The lagrangian density of massless QCD is written 



(A.99) 



where ip are the quark fields and A are the gluon fields. The covariant derivative V is given 

by 



V = d +igA , 

V = d -igA , 

V ab _ s ab d + gf abc A c , 



and the gluon field strength, J 7 , is 



= — [V»,V v ] a = d»A v a - d v A» - gf hc A»A v c 



(A.100) 
(A.101) 
(A.102) 

(A.103) 



The Feynman rules for covariant gauge-fixing are shown in Figure A.l. For the use of these 
rules, see Section 1.3. 
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i 



p - m 



WlMMJllM 



i 

T 2 



-g^ v + (l-0 ^ 
4 



a ,|0. 



= -igt a 



a ,|a. 



= gf abC \? 



( P -k)V v +(k-q)V" + (q-p)V 

f abe f cde (g^ g^-g^g^ ) 

f ace f bde ( g nv g^°-g^g VA - ) 

f ade f bee ^ ^ v ^ g nX g vo ^ 

Figure A.l: Feynman's rules for a non-abelian gauge theory with covariant gauge-fixing. The 
straight line represents quark propagation, while the squiggly line represents gluons and the 
dashed ghosts. In order to avoid index pollution, most of the indices here have been suppressed. 
The proper pole-prescription is given by adding is to each denominator. Along with each 
vertex, a ^-function ensuring energy-momentum conservation is implied. 
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This lagrangian leads to the equations of motion 

pip = (A.104) 

= (A.105) 

VfTl* = g^t a ^ij} (A. 106) 

and the energy-momentum tensor density^ 

- ^Hv "ty - T^K a + \g^K P Kp , (A.108) 

where D= ± - £H . 

The coupling of QCD enjoys the /3-function 



fda01 ^ ^ K(/x 2 ))= _^_ 6oas(/i2)2 __l_ 6;!n : (A . |()!)j 



6 = ^CA-^n/Tp (A.110) 

61 = | [17C^ - 2n/T F (3C*- + 5CU)] , (A.lll) 

where a s = 3 2 /47r, = 3, Cf = 4/3, 7> = 1/2, and rif is the number of active (light) quark 
flavors. 

The fundamental Ward identity of QCD can be written 

g d, (Tft(x) 0(x u x n )) = -id, (t S ° {X s ^ Xn) ) , (A.H2) 

where 

& = ^t a i> + f ahc H a A c a (A. 113) 

is the chromodynamic current. This identity is valid for any gauge-invariant operator O, 
though in covariant gauges one must add a ghost contribution to j£. 

To compare fields that are separated in spacetime, it is necessary to introduce the gauge 
link 

g c ( Xl y)=V e - lg 5 1 a A » (z(T))(dZ " /dT)dT , (A.114) 

4 This expression is also valid for massive quarks, in which case I must specify that the symmctrization of 
indices in the first term does not indicate a removal of the trace; the trace of Q^" is given by 

6% = m^V> + v^T • (A.107) 
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which transforms as 



Gc{x,y)^U{x)Q{x,y)tf{y) 

under a gauge transformation. 

The one-loop splitting functions for unpolarized 

Pns(x) = C F l -^ = P s {x) 
1 — X 

P sg (x) = Tp [ x 2 + {l-x) 2 ] 
l + {\-x) 2 



(A.115) 



P 9 (x) 



C F 
2C A 



1-x 



1 -x 



H h£c(l-a;) 



and polarized 



Pns{x) = P s (x)=P NS (x) 

P sg (x) = T F [x 2 -(l-x) 2 ] 

Pg s {X) = C F - 

P g {x) = 2C A 



x 

l-2x 



1-x 



parton densities, coupled with the endpoint contributions for quarks 



and gluons 



o x" x-y 



Aa= *n f T F -llC A+2CA r dy 



6 Jo x-y 

generate the DGLAP evolution equation in the nonsinglet 

a s (/i 2 ) f 1 dy ( x s 



and singlet 



D„ 



^J^fg/Tix^ 2 ) 



a s (n 2 ) f 1 dy 



(A.116) 
(A.117) 
(A.118) 
(A.119) 

(A.120) 
(A.121) 
(A.122) 

(A.123) 

(A.124) 
(A.125) 



^^rjNS/Tix^ 2 ) = ^ jf ^-PNS (~) f NS /T{y^ 2 ) (A.126) 



2tt J x y 
<* s (/x 2 ) f 1 dy 



p s (?) f s/T (y^ 2 ) + 2n f p sg (?) f g/T (y^ 2 ) 
p 9 s (f) fs/Av, m 2 ) + p s (?) f g/T (y, v 2 ) 



(A.127) 
(A.128) 



2tt J x y 

sectors. The polarized evolution equations are obtained simply be adorning these with tildes.^] 

5 As long as one is careful with the definitions of antiquark distributions. See Section \>. \ . 
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A. 5 Other Useful Relations 



Here, I present some miscellaneous relations which prove useful in the context of quantum field 
theory : 

[AB,CD] = A[B,C]D + [A,C]BD + C[A,D]B + CA[B,D] (A.129) 

= A{B, C}D — {A, C}BD — C{A, D}B + CA{B, D} (A.130) 

e^e^ afj = -24 (A.131) 

e^ aP e Xva p = -6^ (A.132) 

e^expap = -2(6W-W$) (A.133) 
e^e Xpaf3 = -( 5^6° - 

+ 5^-6^ (A.134) 

e^^expor = ~ {K 6 p S " S ? + 23 Permutations) (A.135) 

g pv e \p*T = g p,\ e vp<?T +g vp e \var +glM T e \pvT +g p,T e \p<rv (A.136) 
1 „ 1 /^N^iTT ( d N 



P— + (-1)" — - — 5(x) (A.137) 
(x + ie) n x n K ' T(n) \dx) y ' y ' 

J dxe lkx = 2TrS(k) (A.138) 



and generally : 



= n (A.139) 

i=l 

E< - ^ (A.140) 

i=i 2 

±i> = "(" + 1)(2"+D (A . 141) 



i=l 



1=1 



Vz fc = fcllimiLf— , i (A.143) 

^ t^o dt k e* - 1 v ; 

£(-l) n-i = (A.144) 



2 

E( _ ir - (i _ i» + i)_i + <_i)_ (AJ45) 
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l l 2 



1=1 

n 

5>i)-^ 3 

i=l 

n 

n 



i=0 



i=0 



x-y 



1 



1-z 



n 
z 

n — 1\ — 1 

i - 1 



E 

i=0 



n + i 
i 



E 

E(-d 



i=j v 7 vy 



j=0 



n(n + 1) 
2 

(n + l) 2 (2n-l) | ! + (-!)" 



= lim 



d k e (n+i)t + ^_ 1 y 



t^o dt k e* + 1 
1 - x n+1 



l-x 



(A.146) 
(A.147) 
(A.148) 
(A.149) 



l-x 

log(l - a;) = 

log(l -x) = 

log 2 (l-x) = 



n-Z-l 

6(n-2Z-j-l) £ 

i+j-i 

-Q(2l + j-n-l) J2 

i—n — l 

oo 

E*< 

OO ^ 

^ — X 

i— 1 

oo i 1 

-E*'Ej 
»=i j=i j 

oo i-1 



{z + vT = E 



i=0 



77.! 



z!(n — i)! 
i. 



= 2™ 



71+1 

.7 + 1 

n + j + 1 
3 

n )(x + y y 



= 5, 



n0 



(A.150) 
(A.151) 
(A.152) 

(A.153) 
(A.154) 

(A.155) 

(A.156) 
(A.157) 
(A.158) 

(A.159) 

(A.160) 
(A.161) 
(A.162) 
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i=0 V*/ 



<-nV 

( _ ir ^+i 

n+ 1 
n + 2 



(A.163) 
(A.164) 
(A.165) 
(A.166) 
(A.167) 
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Appendix B 

The Lorentz Group and its Representations 

In this appendix, we will discuss some of the more mathematical properties of the spacetime 
translation and rotation group. First, I define some terms. 
A group is a collection of objects which satisfy properties 

1) Any two elements, x, y, in the group may be composed with each other to from another 
element of the group, xy or yx. This operation is called multiplication. 

2) Group multiplication must be associative, so that \/x, y,z e g 

x(yz) = (xy)z . (B.l) 

3) g also must possess an identity, 1, such that Va; e g 

xl = lx = x . (B.2) 

4) Each element of the group must have an inverse, x , associated with it such that 

xx- 1 = x~ x x = 1 . (B.3) 

Group multiplication does not have to be commutative. In fact, it usually isn't. A group 
whose multiplication operation is commutative is called abelian. 

A representation of a group is a collection of objects, one for each element of the group, 
which satisfy the group multiplication law. It is called a representation because it represents 
the abstract group in some concrete way. For example, the matrices 




(B.4) 
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with ordinary matrix multiplication, represent the group Z2. Here, 2 = 2 _1 . The objects 
{1, 0}, with the multiplication operation identified as addition and 1 + 1 defined to be 0, also 
form a representation of Z2. In this representation, 1 = 0. 

A representation R is called irreducible if for every x, y G R there exists an element 
z £ R such that = x. For example, complex 2-vectors with unit norm from an irreducible 
representation of SU(2). Choosing a complex 2-vector with unit norm at random to associate 
with the identity, say (1,0), we see that any other vector in the representation can be obtained 
via an SU (2) transformation : 



Associating the vector (a, (3) with the transformation necessary here gives us the represen- 
tation. On the other hand, the representation of SU (2) formed by unitary hermitian 2x2 
matrices is not irreducible. Indeed, performing unitary SU(2) transformations on the unitary 
hermitian matrix 



will certainly not get us to any other element of our representation. This element forms 
a representation all by itself. However, once we remove it, we are left with an irreducible 
representation which any of the three Pauli matrices will be happy to span for us. 

This appendix is concerned with the irreducible representations of the spacetime symmetry 
group, SO(3, 1). We will find it to be a continuous group that relates different inertial frames 
of reference to each other. Since it is a fundamental symmetry of nature, physical particles 
and states will form representations of it. This is the foothold which we use to formalize a 
theory of objects that we cannot directly interact with (on their level), and whose interactions 
do not always follow our intuition. 

Much of the analysis of Section B.2 is based on that found in S. Weinberg [ft8| . 



The contemporary understanding of special relativity is based on the assumption that the 
speed of light in a vacuum, c, is the same in all reference frames. To see what this implies, let 




(B.5) 




(B.6) 



B.l Relativity 
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us suppose that a beam of light travels in vacuum from spacetime point to spacetime point 
P. The coordinates of P can be written in a compact notation as x^, where jj, runs from to 
3. a; is the time coordinate and x 1 , i = 1 — 3, are the spatial components of this '4- vector'. 
Since light goes from to P in vacua, 



c 2 ^) 2 = (x L ) 2 + (x 2 ) 2 + (x 6 ) 2 , 



of] 



-3-1 2 



c 2 (x°j 2 - {x 1 ) 2 - {x 2 ) 2 - {x A ) 







(B.7) 



(B. 



In another reference frame whose origin coincides with this one, the final destination of the 
light beam will be P' . However, since c is the same in all reference frames, the new coordinates 



must also satisfy ( B.8 ). Hence the quantity x will be zero either in all reference frames or 



none. 



Let us study an example where x 2 ^ 0. We consider a laser on a train pointed at a mirror 
on the ceiling. The train has height h and is moving at a speed v with respect to the ground 
in the 1-direction. Taking the origin to be the time and place the laser beam is turned on, we 
study the final destination of the 'front' of the laser beam in two different frames. Observer A 
seated in the train measures the final destination x\ = (2/i/c, 0, 0, 0).^| Observer B standing 
on the ground measures Xg = 2h/cy(l,v,0,0), where 7 2 = 1/(1 — v 2 /c 2 ). Hence x^ A ^ Xg, 
but x\ = x 2 B . It can be shown that this is the case for any process in special relativity. The 
assumption that the speed of light is the same in all reference frames then implies that the 
line element 

ds 2 = -g^drt 1 dx v , (B.9) 

where 



9^ 



/l 
0-10 

0-1 

Vo o o 



\ 




-lj 



(B.10) 



lr The component x 2 and the 4-vector square x 2 = c 2 (x ) 2 — (x 1 ) 2 — (x 2 ) 2 — (a; 3 ) 2 are obviously quite 
different. However, since the 2-component of a vector is seldom singled out, confusion is rare. 

2 Here, as always, component listing is read x^ = (x° , x 1 , x 2 , x 2 ). 
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is invariant under all changes of frame. In this expression we have implied summation over fj, 
and v. Any time an index is repeated summation over that index is implied unless explicitly 
stated otherwise.^ For simplicity, we have chosen units in whichQ 

c = 1 . (B.ll) 

With these conventions, we may write x 2 = g^x^x" . The matrix g^ v is called the metric 



tensor. It defines the geometry of the spacetime we work on. The specific form (B.10) is called 
the Minkowskii metric and is relevant for the flat spacetime considered in this work. 

Physical constraints require our changes in frame to be linear^], so we can write a general 
transformation as 



x 



'» = h»x v . (B.12) 



A is our transformation matrix, known in flat space as a Lorentz transform.^ Invariance of the 
line element implies 

g a& = k^ gtiv . (B.13) 

Since the metric behaves in this way, we see that the product of any two vectors x and p, 
x ■ p = g^x^p 11 , is also invariant under changes of frame. [] 



We can think of Eq.(B.12) as the way an upper index transforms. Defining x M = g^x", 
we see that a vector with a lowered index transforms like x'^ = g lia h^,x v . Defining g^ v such 
that g^ a g av = S^, with 5% = if [i ^ v and = 1 if fi = v, we can put this transformation into 
the form 

x^^x^ = (A-%x„ . (B.14) 

3 This is known as Einstein's 'summation convention'. Note that in order to sum properly, one index must 
be 'up' and the other 'down'. We will see why this is the case below. 

4 This means if you ask how far away something is, I simply tell you how long it takes light to get there. 

5 If you don't believe me, imagine how a walk to the grocery store then to the post office looks in two 
reference frames. The grocery store has the two coordinates Ga and Gb while the post office has the two 
coordinates Pa and Pg. The vector from the grocery store to the post office is by definition P4 — Ga in 
the first frame and Pb — Gb in the second. If the transformation were nonlinear, then this vector would not 
transform into its image in the other frame. 

6 By convention, the first index of A is 'up' while the second index is 'down'. It really should be written 
A M „. Indices on matrices are always understood to be this way, unless otherwise emphasized. 

7 Now we see why the notation x 2 was used for the strange object in Eq.(B.8): g^uX^x" = x ■ x = x 2 . 
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It is now obvious why g^x^p 1 ' = x^p^ is invariant under transformations - one transforms 
like A and the other like A -1 . 

Normal vectors, like position and momentum, naturally have an upstairs index. These are 
called contravariant vectors since they transform in a different way than the axes (rotating the 
axes one way produces the same effect as rotating vectors the opposite way). Vectors which 
naturally have a downstairs index, like the derivative, are called covariant vectors since they 
transform in the same way as the axes. To see this, let us examine the derivative operator : 

r\ r\ r\ o 

ir- fix) -» in-f(?) = ^/( A ~ V ) = (A -1 )^- — f(x) . (B.15) 
dxv v ' dx'» y ' dx'v K ' x '^dx v y ' y ' 

Hence, d/dx^ = does indeed transform opposite to x^ (or in the same way as the axes)^] 
Tensors behave in exactly the same way. A tensor with n contravariant and m covariant 
indices transforms like 

- (m\ ■ ■ ■ (A)& (a- 1 )* ■ • • (A-y^T^:;j : . (B.i6) 

Often, we wish to emphasize certain symmetries possessed by tensors. This is done via paren- 
theses and brackets in the following way f\ 



2 

ji[a/8]/*i— /<„ = }_ ^paPpi—Hn _ jipan-i—fin^ (B 17) 

Invariance under coordinate transformations plays an important role in both General Rel- 
ativity and the Standard Model. In General Relativity, this invariance plays a central role and 
severely constrains the form of theory. In the SM, we are concerned mainly with flat space so 
the constraints imposed are not as harsh, but the theory is still constrained in much the same 
way as rotational invariance constrains classical mechanics. For example, suppose we wish to 

8 When indices are suppressed in this kind of calculation, confusion as to the specific way they are contracted 
can arise. The standard convention is that a proper sum goes from covariant to contravariant. For example, 
(Ax) M = b!* u x u , but K^x v = [A~ 1 )"x v = (xA _1 ) M . Note that in the second example the placement of the 
indices is crucial to a correct understanding of the product. The first expression is not a proper sum since it 
goes from contravariant to covariant. In order to write it in an unambiguous way, we had to emphasize the 
order of the indices. 

9 Sometimcs, as we will see below, parentheses also imply tracclessness. It should be clear from the context 
which is meant. 
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calculate a vector quantity which depends only on a certain momentum, p. A priori, we must 
calculate 4 different quantities. However, the requirement of Lorentz covariance forces this 
object to transform like a vector under Lorentz transformations. Since p is the only vector in 
the problem, our quantity must be proportional to p. We need only calculate the constant of 
proportionality (which will in general depend on all invariants in the problem, i.e. masses and 
p 2 ). With two indices, the problem is a little more complicated. Suppose we have a two- index 
tensor which depends on no momenta. The argument above would lead to for an answer 
since there is 'nothing to carry the indices'. What this means is that there is no reference 
to frame in this problem. Our quantity must be frame- independent. However, this does not 
imply that it is zero since the tensor satisfies our requirement, as exemplified by ( B.13| ). 
In this case, Lorentz invariance forces our tensor to be proportional to g^ v . 

There is only one other tensor-like object which is invariant under Lorentz transformations. 
The Levi-Cevitap^| tensor is defined as a 4-index tensor which is completely antisymmetric in 
all four indices. Since there are only four possible values for each of the indices, the Levi-Cevita 
tensor has only one independent component. This component is chosen to be 1, i.e. 

e 0123 = +l . (B.18) 

All other components are obtained from the requirement of total antisymmetry. To see that 
the Levi-Cevita tensor is invariant, we transform it : 

e Mi/W3M4 _> A%A%A%A%e UlU2V3Ui = det(A)e /Xl/<2 * t3 ' <4 , (B.19) 

where the last equality comes from the definition of the determinant, 

dct(A) = ^e^^^KlKXKt ■ (B.20) 



( B.13 ) implies that the determinant of A is either +1 or —1. The structure of the Lorentz 
group is such that all transformations with determinant — 1 can be obtained from determinant 
+1 transformations and the two discreet operations of parity and time-reversal. We see from 



(B.19) that these two operations change the sign of the Levi-Cevita symbol. This fact causes 



e to play an important role in the spin structure of hadrons in QCD. 



J Cevita is pronounced 'chaveta,', while Levi is like 'Drove my Chevy to the levi.. 
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Up to now, our analysis has been quite general. None of the above manipulations have 



required the use of the Minkowskii metric (B.10). At this point, we specialize to flat-space 



and discuss the Lorentz transformations explicitly. ( B.13j ) implies that A is orthogonal, i.e. 
A T A = 1. In view of the above discussion on the operations of parity and time-reversal, we will 
consider only those transformations with det A = +1. The group of orthogonal transformations 
in Minkowskii space with unit determinant is called SO(3, 1) (the special orthogonal group in 
3 spacelike and 1 timelike dimensions). Any real orthogonal matrix can be written as e M , 
with M — — A1 T a real skew-symmetric matrix. A general skew-symmetric real 4x4 matrix 
has six independent components, and hence the space of these objects can be spanned by 
six linearly independent matrices. The matrices we choose to form this basis are called the 
generators of 50(3, 1) since they generate the group. The algebra satisfied by the generators is 
called the Liejj^] algebra of the group. Any member of the group is uniquely determined by the 
six coefficients of the generators. The fact that there is a one-to-one mapping between these 
coefficients and the elements of the group means that the coefficients form a representation of 
the group, the adjoint representation. 

The six generators of SO(3, 1) can be conveniently placed into an antisymmetric 4-tensor, 
i.e.0 

M$ = \6» v {Jn% ■ (B-21) 

In principle, we are now free to choose whatever form we like for the generators J py , as long 
as we end up with six linearly independent skew-symmetric real matrices. The most simple 
choice is also the most natural : 

vn«fi = m - w ■ ( B - 22 ) 

Working out the commutator of the generators, 

[J"", J af3 ] = g fl0 J va - g v Pj» a - g^rP + g va J^ , (B.23) 

we find that 5*0(3, 1) is a non-abclian group. This means that we must be careful with the 
order of transformations. In particular, 0's for successive transformations do not add. 



11 As in the lie of the stone.' 

12 The factor of 1/2 corrects the overcounting of the implied sum on fiu. 
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Since three-dimensional rotations are among the possible Lorentz transformations, our 
generators should include those for ordinary 50(3) rotations. This is indeed the case, as we 
can see by examining any generator with both /i and v spacelike. For example, 

/0 0\ 
0-10 
10 

Vo o/ 

It is easy to see that this matrix has four different powers, only two of which are linearly 
independent. Hence the 50(3, 1) rotation it generates is 



(B.24) 
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cos 9 


— sin 9 








sin 8 


cos 9 
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J 



(B.25) 



which is a rotation of 9 radians about the 3-axis. Hence we identify J 12 = L 3 . For general 
rotations, the identification^ e % ^ k J^ = 2L k is appropriate. ( B.23| ) then implies 



[L\U] = e ljk L k 



(B.26) 



as expected for generators of 50(3) rotations. Note that pure rotations form a subgroup of 
50(3, 1), that is 50(3) C 50(3, 1). 

The other three generators have one time index and one spatial index. These objects will 
generate Lorentz boosts, as can be seen by examining J 01 : 



(j oi r 
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Vo 








o7 



(B.27) 



13 It is a standard convention to use Greek letters for general indices and Roman letters when one wants to 
specify a spatial index. The e tensor here is the Levi-Cevita tensor for 50(3), with the convention e 123 = +1. 
Since the metric relevant to SO (3) is trivial, we make no distinction between contravariant and covariant 
indices. However, a few well-placed minus signs are required when going from a general expression to one 
which specifics spatial indices since gij = — Sij. 
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Once again, the powers of this matrix are cyclic so exponentiation is trivial 

/cosh6> sinh6> 0\ 
sinhfl coshfl 





V o 



1 
1/ 



Under the action of this transformation,^] 

t — ► t cosh 9 + x sinh 9 
x — > x cosh 9 + t sinh 9 

y -> y 



(B.28) 



(B.29) 



It is obvious from the infinitesimal form of this transformation that we have given the vector 
x a velocity 9 along the 1-direction. For asymptotically large values of 9, x and t approach the 
same value (t + x) cosh#, making the vector x approximately lightlike.p| Hence asymptotically 
large 9 boost the vector x approximately to the speed of light. This argument promotes the 
identification v = tanh#. We can now eliminate 9 in favor of v to obtain 

t — > j(t + vx) 
x — > j(x + vt) 

y -f y (B.30) 



the new coordinates of a vector x which has been endowed with a speed v along the 1-direction. 
Again, we have used 7 = (1 — v 2 ) -1 / 2 . This transformation reduces to the familiar Galilean 
transformation if we reinstate c and take v <C c (unless we take xv/c 2 comparable to t). 
The transformations J° l = K l do not form a subgroup of 50(3, 1) since ( B.25| ) implies 

[K\K j ] = -e ijk L k . (B.31) 

x 2 = y, and x s = z interchangeably. Confusion between x 1 = x 



14 



We will use the notation x 



.0 



t, x 



,.1 



and x M = x should be rare. 

15 Of course, this vector does not actually become lightlike since its invariant square is invariant. However, 
in our new frame x 2 will in fact be much smaller than the square of its t and x coordinates. This makes the 
vector look lightlike in this frame. 
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The commutator of L with K is 

[L\Ki] 

which closes the algebra of the group. 



= e ljk K k , 



(B.32) 



B.2 Unitary Representations of 50(3, 1) 



In this section, we will study how the Lorentz group acts on fields. The first thing we must 
do is expand our group to include translations. Under a translation, the vector x is shifted 
to x + a. This transformation is not linear since (x + y) + a ^ [x + a) + (y + a). However, a 
function of the vector x will transform linearly under a translation : 



Note that we have implicitly defined the transformation T(a) in such a way that it shifts the 
function it acts on forward an amount a rather than the axes. Hence the value of the un- 
shiftcd function at x is the same as the value of the shifted function at x — a. This extension 
of the Lorentz group is called the Poincare]^ group, and its representations form the basis for 
quantum field theory. 

Since we do not want to alter the normalizations of our states through these transfor- 
mations, we will be concerned with unitary representations of the Poincare group. Ele- 
ments of these representations will be related to the coordinate transformations A and a, 
i.e. U = J7(A, a). The group multiplication rule, 



is governed by these transformations as well. This rule implies U~ 1 (A, a) = U'(A, a) = 
[7(A -1 , — A _1 a). U is unitary, so it can be represented as the exponential of an anti-hermitian 
matrix. The translations add four generators to the Lorentz group, one for each dimension of 
spacetime, so we have the form 



T(a)f(x) = f(x-a) . 



(B.33) 



[/(A 2 ,a 2 )f7(Ai,ai) = E7(A 2 Ai, a 2 + A 2 oi) , 



(B.34) 



U(l + 2^/3 J' 



(B.35) 



like s'il votis plait 
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for arbitrary infinitesimal e and 9. Note that J are the matrices from the last section; they 
act on physical spacetime vectors. J and P are hermitian operators that act on the space in 
which the fields we are concerned with live. Consider the transformation 

U(A,a)U(l + i6'J,e)C/" 1 (A,a) = U(l + ^A6JA~\ Ae - ^AOJA^a) . (B.36) 

Expanding in 9 and e, substituting the explicit expression for J, and noting that each compo- 
nent of 9 and e may be varied independently, we arrive at 



C/(A,a)J^C/- 1 (A,a) = (A" 1 )^"^ (j' 
U(A,a)P^U- 1 (A,a) = (A" 1 )^ . 



a/3 _ a ap/3 , a /3pa 



(B.37) 
(B.38) 



Equation (B.37) states that for pure rotations (a = 0), transforming the operator J is the same 
as performing the inverse transformation on the coordinate indices of J. The inverse trans- 
formation appears here for the same reason that x — a appears as the argument of T(a)f(x). 
When translations are included, we see that J transforms in exactly the way we expect an 



angular momentum to transform under a change of origin. ( B.3S ) tells us that the operator 
P transforms as a vector under Lorentz transformations, but is invariant under translation. 



We can obtain the Lie algebra of this group by considering ( B.37 ) and ( B.38 ) for infinitesimal 
transformations A and a. Performing the same manipulations as above, we see that 



P", J af} 

pp. pv 



= . 



(B.39) 

(B.40) 
(B.41) 



The last identity implies that the subgroup of pure translations is an abelian group. The 
algebra of the Lorentz subgroup is the same as we found above, as it should be. However, 
it must be emphasized that there we were discussing 4x4 matrices rather than the infinite 
dimensional quantum mechanical operators here. 

P and J have a very concrete physical meaning in quantum mechanics. To see this, 
consider a system which is translationally invariant. Since the phase of an isolated^ quantum 

17 If it wasn't isolated, it could not be translationally invariant. If state vectors interact, we must translate 
them all simultaneously. This is exactly the same as in classical mechanics, where we must consider the total 
momentum of a system to obtain conservation. 
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mechanical system is not observable, the state may change at most by the overall phase 



T(a) \S) = e iS ^ \S) , (B.42) 

where 5(a) is some real function of a. In light of the above discussion, we see that 6(a) should 
be identified with P ■ a. This implies that \S) is an eigenstate of P. According to JB.41 ), 



this eigenvalue is conserved under translation. In classical mechanics, the conserved quantity 
related to translation invariance is the momentum]^] Hence, we identify the conserved quantity 
here with the momentum of the system.^] Since the momentum of the system is an eigenvalue 
of P, we call P the momentum operator. 

Attempting to apply the same analysis to the operator </, we run into a little snag. The 
fact that the J's do not commute with each other implies that \S) cannot simultaneously be 
an eigenstate of all of them unless most of its eigenvalues are zero^] When discussing J, it 
is convenient to define L l = \j2t^ k ,P k and K % = J Ql in analogy with the last section. The 



commutation relations (B.39) imply 

[U, LP] = ie ijk L k , (B.43) 

[k\K j ] = ~ie^ k L k , (B.44) 

[L i ,K j ] = ie i]k K k . (B.45) 

These relations have the interesting consequence that the operators 

A i = -(t i + iK' 



2 

& = - (V - iK') (B.4(>) 



18 Remember, momentum is conserved in the absence of external forces, i.e. forces which do not belong to 
the system we consider. These forces would break translational invariance, and thus momentum conservation. 

19 This really implies that the momentum of the system is proportional to the eigenvalue. If the momen- 
tum of the system is p and the eigenvalue is k, we say p = hk. p has the dimensions of momentum (En- 
ergy/Speed=Energy since we work in units where speed is dimensionless) and k has the dimensions of an 
inverse time (since the exponential of P ■ a makes sense; remember that since speed is dimensionless length and 
time are the same unit), so h is EnergyxTime. Once again, we can adjust our units so that H = 1. In other 
units, we can actually measure its value; in SI units, h = 1.05457266(63) X 10 -34 J ■ s. 

20 We can see this as follows. Suppose J^ V \S) = j^lS). Now, j x P]\S) = since the numbers j?" and 
j Xp commute. This implies that the commutator has eigenvalue zero. 



229 



generate commuting subgroups of 50(3, 1), 

[A\A 3 ] = i^ k A k 

[B\B j ] = ie i]k B k (B.47) 
[A\B' J ] = . 

Furthermore, the commutation relations satisfied by these generators identify the subgroups 
with SU (2). Hence the group of Lorentz transformations, 5*0(3, 1), is seen to be nothing more 
than the direct product of two commuting 5£/(2)'s. This will be seen even more explicitly 
when we study the fundamental representation of 5L(2, C) and its relation to 50(3, 1) in the 
next section. 

In non-relativistic physics, we concern ourselves with time evolution and consider a quantity 
to be conserved if it does not change with time. The Schrodinger approach to quantum 
mechanics considers state vectors that change with time, 

\S(t)) = e- itA |5(0)) . (B.48) 

The time dependence is governed by the time-translation operator, or Hamiltonian, H = P°. 
The action of an operator on a state will change with time because the state itself changes with 
time. In any physical matrix clement, we can use this relation to display the time dependence 
explicitly : 

(S'(t)\ O \S(t)) = (5' (0)| e ltfl Oe- u " |5(0)) . (B.49) 

This promotes the identification 0(t) = e ltH Oe~ ltH , which puts the time-dependence of the 
matrix element into the operator. This is Heisenberg's view of quantum mechanics. Here, the 
action of an operator on a state will change with time because the operator changes with time. 
If the operator we are concerned with commutes with the Hamiltonian, it will not depend on 
time. Hence alloi its matrix elements will be conserved. Looking at Eqs.( B.4C ) and ( B.4l| ), we 



see that P l and L l are conserved operators.^ Since U is a conserved operator associated with 
spatial rotation, we identify it with the angular momentum operator. If a system is invariant 

21 In non-relativistic quantum mechanics, one usually has some sort of space-dependent potential that com- 
pletely destroys translation invariance. These potentials act as static momentum sources which effectively 
generate nonzero commutators of the Hamiltonian with momentum and angular momentum operators. 
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under rotation about a certain axis, say h, the argument given above for P implies that it is 
and eigenstate of n ■ L. On the other hand, the commutator 

[H,K i ]=iP i (B.50) 

implies that K % will depend on time. 

In relativistic quantum mechanics, we are concerned mostly with operators; the external 
states can all be formed from the vacuum by the action of various operators. For this reason, 
it is Heisenberg's approach that is most useful here. The fact that time and space mix under 
Poincare transformations makes it natural for us to consider them together. In a translationally 
invariant theory, the space dependence of the external states can be removed in exactly the 
same way as the time dependence. For any spatial 4- vector and any quantum operator 0, 
we have the operator identity 

0{x) = e ixP 0{Q)e- ixp . (B.51) 

Operators that commute with all of the P^ do not depend on coordinates at all-p^l 

We now turn to a classification of the states in our theory. Suppose we have a state which 
represents an electron with momentum and spin polarization s M . Poincare transformations 
can change the electron's momentum and spin, but certainly cannot change the fact that it is 
an electron. This means that general properties attributed to electrons must be associated with 
operators that commute with all of the generators of the group. Operators with this property 
are called Casimirs of the group. It is easily shown that the mass operator^ P ■ P = P 2 is 
a casimir of the Poincare group. The other casimir has to do with spin. We know that an 
electron has spin- 1/2 regardless of where it is or how we rotate it. There must be an operator 
that expresses that. By inspection, one can see that the Pauli-Ljubanskii vector 

W" = ~ l -^ a(i PJ af) (B.52) 

is exactly what is needed. Under parity, behaves exactly as a spin should since e changes 
sign. W is also orthogonal to P and reduces to a spatial angular momentum operator in 

22 This is what we mean by 'translationally invariant' ! 

23 Remember that in relativity the invariant square of the momentum of a system is the square of the mass 
of that system. 

24 Under the influence of the Levi-Cevita symbol, the operators P and J commute. 
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the rest frame, P = (M, 0, 0, 0). The commutation relations of this vector with the generators 
of the Poincare group are identical with those of P M : 

-i[W> M ,P v ] = 

-i[W», J af3 } = g^W 13 - g^W a . (B.53) 
This makes it obvious that its square 

W 2 = --J^J^P 2 + Jx P .r p P X Pa (B.54) 

is a casimir.^l 

To classify the irreducible representations of the Poincare group, we appeal to the idea of 



Wigner's 'little group' 89 . To begin with, we diagonalize P^ : 

\p) = pf \ p ) . (B.55) 

Since is a vector, it will transform under Lorentz transformations as 

p"-fA£p" . (B.56) 

We can use this to transform our state to a convenient frame with a representative momentum 
p^ . In this frame, there will be residual transformations which do not change p^, but may act 
on our state. This group is called the little group associated with the reference momentum 
p^ . The representations of the little group will tell us how many different states of momentum 

are related to our state through the Poincare group. Since W commutes with P, we can 
diagonalize one of its components to break the degeneracy associated with this subgroup. We 
will consider two examples of reference momentum p^ since these are the only two which 
actually occur in nature (to our knowledge). 

As our first example, we take states for which p° > and p 2 = M 2 > 0. A look at the 
Lorentz transformations tells us that if p° > in one frame, it will be in all frames, provided 
p 2 > 0j^] Furthermore, there is always a rest frame, defined by p^ 1 = (M, 0,0,0). The little 

25 This is obvious once one knows that W is a vector that commutes with P. Any Lorentz scalar will 
commute with the rotation generators. It is the fact the W is translation invariant that makes it special. 

26 We consider here only the proper Lorentz group. Parity and time-reversal can be taken into account 
separately later. 
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group associated with this momentum is the group of spatial rotations, SO(3). In this frame, 
we have W° — and W % = MLi. Since L 2 = LiLi commutes with Lj, we can simultaneously 
diagonalize L 2 and one of the Lj : 

L 2 \£m) = e(£+l)\£m) 

L 3 \£m) = m\lm) . (B.57) 

Since L 2 is positive definite, we can take I > 0. The reason for the special form of its eigenvalue 
will become clear shortly. The hermiticity of L 2 and L 3 allows us to choose orthonormal 
eigenstates. Defining 

L±=L 1 ± iL 2 , (B.58) 

we have 

[L 3 ,L ± ]=±L ± . (B.59) 

Hence 

L 3 L ± \tm) = (m ± l)L± \tm) , (B.60) 

making L± the raising and lowering operators associated with L%. If this process were to go 
on indefinitely, we would at some point have a state for which m > £. However, 

£(£ + 1) = (£m\ L 2 \tm) = (£m\ L\ + L% \£m) + m 2 (B.61) 

implies that m 2 < £(£ + 1). This means that there must be some state \£m±) that truncates 
the process, 

L± \£m ± ) = . (B.62) 

Writing L 2 = L±L T + L 2 =F L3, it is easy to show that m± — ±£. Starting with the 'top' 
state, \££), we can apply L_ to successively lower m by one unit each time. At some point, 
we must reach the 'bottom' state \£{—£)) else suffer the consequences of inconsistency. Since 
we cannot apply L_ a fraction of a time, this implies that £ — n = —i for some whole number 
n. Hence t must be either integer or half integer Q For each £, we can define all of the states 
in our representation from the top state via successive application of L_. The appropriate 
normalization is 

L± \lm) = sfQ. T m){£ ±m + l)\£ mil) , (B.63) 

27 This is the reason we assumed the form l{l + 1) for the eigenvalue of L 2 . 
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as can be obtained from the requirement that l{t + 1) = (lTO|L 2 |lm)p| 

In the rest frame of a massive system, we have shown that the spectrum of W 2 is (— M 2 £(£+ 
1)), where I > is either integer or half-integer. Since this object is a casimir of the Poincare 
group, different values of £ correspond to different irreducible representations. For fixed t, 
there are 2£ + 1 different states in the representation. Under the action of the rotation group, 
these states behave as though they have different angular momentum, so we say that in the rest 
frame the state has angular momentum (spin) I. The spin vector is defined by = W^/M. 
States in other frames can be obtained by boosting the rest frame states. 

The other kind of states which appear in nature have p 2 — 0, p° > 0. In this case, we take 
= (lo, 0, 0, uj) for our reference momentum. The little group for this momentum is generated 
by the components of W : 

W° = loL 3 

W 1 = id{Li+k 2 ) 

W 2 = id^-Kx) (B.64) 
W z = uoL 3 . 

These generators satisfy the commutation relations 

-i[W\W 3 ] = -tdW 2 

-i[W 2 ,W 3 } = ujW 1 (B.65) 
-i[W\W 2 } = , (B.66) 

so we can simultaneously diagonalize W 1 and W 2 . However, using the commutation relations, 
one can show that the existence of one state with finite eigenvalues w± and u>2 for W 1 and W 2 
implies the existence of a continuous infinity of such states : 



J0L 3 



\wi, W2) — \w\ cos 6 + W2 sin#, W2 cos 9 — w± sin 6*) . (B.67) 



Since we observe no massless states with a continuous degree of freedom like this, we have 
no choice but to conclude that W 1 and W 2 annihilate our states. In this subspace, W 1 and 
W 2 commute with W 3 so we are free to diagonalize it. This leaves us with no generators to 

28 We also use the fact that (L±\£m))i = ^m\L T . 
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connect different eigenstates of W 3 . If W 3 has more than one eigenvalue, these states must 
be in different irreducible representations of the Poincare group since they are not linked by 
any of its generators Q 

Massless states of 3-momentum p are characterized by the action of the helicity operator, 
p ■ L/p°. The eigenvalue of this operator, A, is called the helicity of the state. Written in this 
way, the helicity of a system is invariant under the action of the Poincare group. Although 
the proper Lorentz group cannot connect massless states with different helicity, the action of 
parity relates helicity +A to helicity — A|^] Topological considerations require^] 

6 4™l 3 = e 4«A = 1 ( (B.68) 

or A is either integer or half-integer. The fact that A is the eigenvalue of an angular momentum 
operator tempts us to call it the spin projection of the system. However, since the square of 
the Pauli-Ljubanskii vector is regardless of the value of A, we must be careful with such 
terms. 

Explicit forms for the quantum mechanical operators P and J will in general depend on the 
specific theory. These operators can be obtained from the lagrangian using general techniques 



outlined in Section D.2 



B.3 The Spinor Representation of 50(3,1) 

In this section, we consider the smallest representation of S*0(3, 1). From ordinary quantum 
mechanics, we know that the group SO(3) is locally isomorphic to SU (2) (it satisfies the same 
Lie algebra). This led us to the fact that the fundamental representation of 5*0(3) can be 
realized in terms of quantum mechanical 2-spinors. Now, we are faced with the problem of 

29 The other generators are of no use to us here since they will change the momentum of the state. 
30 This is because parity changes the sign of p while leaving L invariant. 

31 The Lorentz group is not simply connected, so the identification of a 2ir rotation with the identity is 
misleading. However, going around the group twice will always get us back to the same element. One way to 
see this is through the rotation subgroup, SO(3). Rotation through an angle 8 off the z-axis and <j> around it 
(which is represented (8,<j>)) is physically identical to (2n — 8,n + <f>). These are two distinct group elements 
which are identified in this representation. This degeneracy is only twofold, so going around twice gets one 
back to the same group element. 
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finding the fundamental representation of the somewhat larger group, SO(3, 1). A simple way 
of going from a certain group to a larger one is to relax some of the constraints defining the 
smaller group. Relaxing the determinant condition in SU (2) only yields a trivial phase which 
commutes with everything. One look at Eq.( B.39| ) tells us that this is not good enough. Let 
us see what happens if we relax the uniterity condition. 

The group of 2 x 2 complex matrices with unit determinant is a representation of SL(2, C), 
the special linear group in two complex dimensions. A general element of SL(2, C) has four 
complex entries with one complex condition. Hence it is composed of six independent elements. 
This certainly is a good sign. Let us take £ a to be a two component object which transforms 
under SL(2, C) as 

C -» (T£) Q , (B.69) 

and r) a to be one which transforms as 

T} a -» (vT^a . (B.70) 

Obviously, because of the way we have defined £ and 77, rj a ^ a is invariant under SL{2, C). Our 
experience with 50(3, 1) has taught us that conditions on matrices lead to invariants of the 
group. There, we saw that the condition detA = +1 led to the invariance of the Levi-Cevita 
symbol. Here, we define e a /3 to be the totally antisymmetric tensor of SL(2,C). We use the 
convention e\2 = +1. Since e is invariant, the product e a p£, a, n is also invariant. This prompts 
us to identify 

fla = eap-q 13 , (B.71) 

making e the metric tensor in this space. Note that our metric is antisymmetric. This means 
that we must be extremely careful with the order of indices and the proper way to sum here. 
For example, e a/ 3r/ a = —r/p ^ r\p. We can define a raising operator such that 

e° , % 7 = ^ . (B.72) 

Note that this implies that e a @ = —e a p- 

The complex conjugate of £ Q transforms under SL(2, C) like T* 7^ T. Hence it forms a 
separate space of transformations. Denoting (£")* = £ Q , we can play the whole game over 
again. The index a is called a 'dotted' index and lives in a different space than a. Since the 
two indices transform in completely separate ways, there is no metric which connects them. 
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As with any group representation, products of spinors give us higher representations. For 
example, the product 

^ = (B.73) 

is a spinor tensor of rank two (or a bispinor). This representation is reducible, as we can see 
by separating it into two independent pieces : 

^0 = 1 ^a/J + £/Ja) + 1 ^a0 _ £0a) _ ( BJ4 ) 

The second term is proportional to the metric e"^ since it is antisymmetric in a and (3. 
Hence it is invariant under SL(2,C). The first term is symmetric in a and (3. We denote 
this as C'"' 3 ). Since it is symmetric, it only has three independent components. Note that 
the total number of components of ( a/3 , 4, is conserved in the process of breaking it up into 
irreducible representations. This is a good check that we have done the separation correctly. 
This equation may be written as 2x2 = 3 + 1 since we began with the direct product 
of two spinor representations (2) and ended with the sum of a singlet (1) and a triplet (3) 
representation. 

In general, a spinor tensor with 2s undottcd indices may be completely symmetrized by a 
systematic removal of all of its 'traces' (contractions with the metric tensor). The remaining 
object transforms as spin s and the traces we removed transform as spin s — 1, s — 2, . . . , or 
1/2, depending on whether 2s is even or odd. These statements can be used as a definition of 
'spin' if one so chooses. The motivation comes from the eigenstates of the angular momentum 
operator, which we have not yet discussed in this context. However, the spectrum of eigenvalues 
of J 2 and J z depends only on the commutation relations of the group SO(3, 1). Since these 
relations are the same as those of SL(2,C), as we will show below, the angular momentum 
operators in our spinor space will have the same spectrum we discussed above. An irreducible 
representation of SL(2, C) with three independent components, like £( a/3 ), will have eigenvalues 
of J z — 0, ±1 and J 2 = 2. Thus is behaves like it has angular momentum, or spin, 1. 

Mixed combinations of spinors are of a completely another form entirely. Since dotted and 
undottcd indices live in two different spaces, it makes no sense to symmetrize or antisymmetrize 
them. The product of a spinor (2) and an antispinor (2) representation is itself irreducible 
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(2x2 = 4). A mixed bispinor will also be real in the sense that 

= C d/3 = C 13 " , (B.75) 

where the last equality comes from the fact that the order of dotted and undotted indices is 
unimportant; these indices are in two different spaces. Mixed bispinors form an irreducible rep- 
resentation of SL(2, C) with four real entries. If representations of SL(2, C) can indeed be iden- 
tified with representations of 50(3, 1), then a relation between this irreducible 4-dimensional 
representation and the representation x^ from Section [B.l| is possible. 

Writing as a 2 x 2 matrix, we see that the relation (B.75) translates into a hermiticity 



condition on Any hermitian 2x2 matrix may be expanded in terms of the identity and 
the Pauli matrices, 

^(i o) °*=-C 1) Ho -J ■ (B ' 76 ' 

Writing cf M = (1, — a), we make the identification 

C^ = (x^a^ . (B.77) 

We would like for £ to behave under SL(2, C) rotations in such a way that this relation is 
preserved. An infinitesimal SL(2, C) transformation has six independent parameters, and so 
can be expressed 

{T) a = S1- l -6 llv {W v r p , (B.78) 

where the E are linearly independent 2x2 matrices. Note that we have emphasized the fact 
that a is the first index on T and (3 the second. The transformation for £ is 

= c - l -o^(^c-c^nT > (B.79) 

where in the second line we have suppressed the indices. The transposition comes from the 
convention of matrix multiplication. In another frame, the new £ should be related to the 

32 The transposition comes from the convention that undotted indices should label rows and the dotted ones 
columns. Of course, this convention is arbitrary, but one must be consistent with whatever convention is 
chosen. 
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new x in the same way as the old ones for any choice of x^. According to ( B.77 ), ( B.79 ), and 



(B.22), we require 

- i£^"V + ia x (£H f = 9 uX ^ - g^a" ■ (B.80) 

If we can solve this equation for £, we have found a covariant relationship between the 4 of 
SL(2, C) and the 4 of £0(3, 1). 

This equation is not as complicated as it seems; the easiest way to solve it is to recognize 
that each element is just a 2 x 2 matrix. Any 2x2 complex matrix can be written as a linear 
combination of the Pauli matrices and the identity, if we allow complex coefficients. Taking 
special values for [i, v, and A then allows us to deduce the coefficients. Defining 

(o^^^r , (B.8i) 

we have 

• (B-82) 



It is easy to show that these objects satisfy the commutation relations of 5*0(3, 1), Eq.(B.39). 

The relationship between vector indices and spinor indices allows us to understand the 
construction of 5*0(3, l)'s irreducible tensor representations. According to Eq.( |B.77 ), we have 
the correspondence 

for an arbitrary Lorentz tensor T. We can form irreducible representations of 5L(2, C) by 
systematically removing the traces of r with respect to like indices. Studying the symmetry 
of different representations under the interchange of two Lorentz indices, i.e. the simultaneous 
replacement (aj/3j, Oj/3j-) — > (ajf3j,aif3i), we see that they can be constructed to be either 
symmetric or antisymmetric .p| 

If antisymmetric, one can either have 

e <*i«i£(ftft-) (B.84) 

or 

eMf(«*«i) . (B.85) 



33 In reality, the construction is not quite this simple. The natural irreducible representations of SL(2, C) do 
not lend themselves to interpretation in 50(3, 1). However, certain linear combinations of them have symmetry 
properties which are readily understood within 50(3, 1). 
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Neither of these is hermitian, but the sum and the difference (divided by i) both are. These 
combinations will be related to irreducible tensor representations of the Lorentz group which 
are antisymmetric under the interchange of two indices. 

If symmetric, we have two possibilities. First, all four indices could be contained within 
metric tensors : 

ga^gftft . (B.86) 



Inspection of the relationship ( B.77 ) between 50(3,1) and 5L(2,0) reveals that this object 
represents the metric tensor g^ 11 of 5*0(3, 1). Hence this possibility leads to a trace in Lorentz 
space. Alternatively, the structure 

^(«."j)(ftft) (B.87) 

satisfies our requirement. This object is obviously related to a symmetric Lorentz tensor 
which does not have a trace. This simple argument can be used to construct the irreducible 
breakdowns of any tensor in 5*0(3, 1). In particular, it implies that the largest representation 
possible is the fully symmetric and traceless combination 

Since this tensor corresponds to a spinor tensor with n symmetrized dotted indices and n 
symmetrized undotted indices, its spin is s = 2n/2 = n. 

To see how this works explicitly, let us study the decomposition of the tensor 

In order to avoid overcounting, we must proceed as systematically as possible. With this in 
mind, we leave the dotted indices alone for the moment and work only with the undotted. The 
possible structures are 

Ts — T (aia 2 a a )$ 1 02$3 gg) 

T t 2 = e^C" 3 ^ 2 * 5 (B.91) 
Tt 3 = e^^ft&fe _ ( B 92 ) 

Note that we did not take all three possible traces as they are not all independent. Traces in 
SL(2, C) are related to antisymmetric structures, so the symmetric combination 

e at3 C + e fi ~<C + e 7 "^ = (B.93) 
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is constrained to vanish. This can be seen by explicitly checking all three components. In 
general, the procedure is to choose one index and systematically remove all of the traces 



involving that index. Other traces are constrained by consistency relations like (B.93). The 
dotted indices decompose for each of the above structures in the same way. 



Equation (B.90) gives the irreducible representations 



~- — n -(otiat2a 3 )($i$2$3) 

l ss — ' 



Tst2 



_ £ /9i/32^(aia 2 Q3)/33 



/3i/33£(aic*2a3)/32 



(B.94) 
(B.95) 
(B.96) 



The first of these is the fully symmetric spin-3 contribution. It corresponds to the fully 
symmetric and traceless 

j^ia^m) _ (B.97) 

Since three fully symmetrized SL{2, C) indices have four components : all the same (2), one 
different (2), this representation has 4 • 4 = 16 components. 



Turning to the trace terms, we see that neither (B.95) nor ( B.96| ) is hermitian, so neither 
can directly be related to representations of 50(3, 1). However, their complex conjugates are 
contained inP| r t 2 S and r t 3 S . Representations of 5*0(3, 1) can only be formed with these 'mixed' 
spinor tensors via hermitian combinations. Since we cannot separate these from each other 
while at the same time retaining hermiticity and completeness of the representation, r st 2 and 
Tj2s must combine to form one irreducible representation of 5*0(3, 1), 



J , [Mi(M2]M3) 



(B.98) 



The fact that 50(3, 1) possesses these two invariant subgroups is already obvious from the 
behavior of its generators in Eqs.( B~46| ) and (B.47). The number of components the tensor in 



(B.95) contains is 4 • 2 = 8; (B.9g ) contains 16 components, as does its friend 



J 1 [A'l(M3]A'2) 



(B.99) 



Note that in order to actually form representations with the indicated symmetry structure, we 

34 My notation reads 'symmetric', 'trace with aa', and 'trace with 03'. The order refers undotted versus 
dotted indices. 
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would need to involve the tensor 

£ Q 2 a3^ai(/3l&/33) JQQ) 

and its hermitian conjugate. With these six tensors, we can form three hermitian Lorentz 
structures. However, in light of the above discussion, only two of them are independent. 
We are left with 

r t2t3 = e Qia2 e^^C Q3 * (B.101) 

Taa = e a i«3 e /3i/3 2C a 2 /3 3 (B.102) 

r t2t2 = e Q i^ e /3i/3 2 ^3/33 (B.103) 

rata = e aia3 e^ 03 C* $2 . (B.104) 

The first two of these expressions must be combined to form a representation of SO(3, 1), as 
before. In this case, the presence of two metric tensors allows us to construct separate hermitian 
representations. Involving the other four mixed-symmetric two-trace tensor structures allows 
us to construct a fully symmetrical tensor which can be identified as the trace we removed 



from (B.97) and a fully antisymmetric tensor which is identified with 

T [wM2Ms] _ (B.105) 

Both of these tensors have 4 components. The remaining two structures, T t 2t2 and TT3T3, are 
readily identified in SO(3, 1) as the other two traces : 

raa ~ g^P» 3 (B.106) 

nun ~ g^ 3 P^ . (B.107) 

Hence in SL(2, C) we have the decomposition 

2x2x2x2x2x2 = 16+(8 + 8)+(8 + 8)+(4 + 4)+4 + 4 , (B.108) 

while in 50(3,1), 

4x4x4 = 16 s +16M+16 A f+4 s +4 M +4 iM '+4^ (B.109) 

is induced. Note that in 5*0(3, 1) all three possible traces show up in the final decomposition, 
the third being formed from the combination of SL(2, 0)'s 4 + 4 which is orthogonal to the 
fully antisymmetric tensor. This is because traces in S0(3, 1) are unconstrained. 

35 under the exchange of pairs of indices, i.e. atj $A —> (aj/3j,ai/3i). 
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This procedure can be extended to tensors of higher rank without incident. While somewhat 
tedious, it certainly beats counting components and enforcing constraint relations to avoid 
overcounting. An equivalent procedure one can use to directly obtain the representations of 
SO (3, 1) is offered by Young's Tableaux. The most useful reference that I have come across 
on this method is |)0| . 

At some point, we would like to use all of this formalism to do physics. Consider a classical 
spinor field, £ a (x). Following the discussion of the last section on irreducible representations 
of the Poincare group, we wish to diagonalize the momentum operator P^ . This operator has 
two spinor representations, (P ■ cr)ap and (P ■ er)"' 3 . Each representation necessarily has one 
dotted and one undotted index in SL(2, C), so the object P£ cannot transform in the same 
way as £ itself. This leaves us with two options : we can either require P to have eigenvalue 
zero or allow it to change our spinor into another kind of spinor. The former option leads to 
the massless Weyl spinors that describe neutrinos, and the latter leads to the massive Dirac 
spinors appropriate for electrons. 

Weyl spinors are obtained simply from the requirement 

(P-o)apZ (> = ■ (B.110) 
Contracting with (P • a) ia and using the identity 

(a^) a =9^S^-2i(wn a , (B.lll) 

it is easily seen that the casimir P 2 has eigenvalue zero so these spinors can only represent 
massless states. Within the proper Lorentz group, this spinor stands alone; it forms its own 
irreducible representation. However, under parity our spinor field transforms asf^ 

PZ a = r)a • (B.112) 
Hence if we wish to consider representations of the full Lorentz group, we must include the 

36 This can be seen in the following way : V(, a cannot transform like £ a because V does not commute with all 
Lorentz transformations. However, since the spatial part of V is proportional to the identity, it will commute 
with spatial rotations. For spatial rotations, our SL(2,C) matrix is unitary so (T _1 )t = T. This implies that 
V£, OL transforms like rja . 
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field r]a with £ Q in our massless representation.^ This field satisfies 

(P-a) a % = . 

Since we wish to consider these fields together, we write 



1> 



for which the statement of translation invariance becomes 



p„ 







(B.113) 



(B.114) 



(B.115) 



The 4-component spinor ip is a very strange object. It mixes dotted and undotted indices 
in a way we have not yet encountered. This mixing has been forced upon us by parity. Systems 
in which parity is not conserved, like the weak interactions, do not require us to consider both 
spinors together. In fact, it has long been thought that the analogue of £ Q for neutrinos simply 
does not exist. Here we are concerned with the strong and electromagnetic interactions, both 
of which conserve parity, so the existence of one helicity necessarily implies the existence of 
the other. The 4x4 matrices in the parentheses come up over and over again. We will call 
them 7 M . We also introduce |* as shorthand for -P At 7 Al . Using this compact notation, we write 
the 'equation of motion' for massless spinor fields as 

fV> = • (B.116) 

For a massless spinor moving in the 3-direction, our equation of motion reduces to 



(° 



















1 


1 











Vo 








o) 



2p° 



which has two linearly independent solutions, 

/°\ 




ip = 



(B.117) 



u(p, +) = V 2p° 



1 

Vo/ 



and u(p, 



1 


Vo/ 



(B.118) 



37 This is a specific example of the general argument given in the last section that parity relates the two 
hclicities ±A to each other. We will see below that £ a corresponds to massless states with helicity +1/2 while 
rja describes states with helicity —1/2. 
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which we normalize in such a way that the following relations hold 



u^(p,+)u(p,+) = ui(p,-)u(p,-)=2p° (B.119) 

u(p,+)u(p,+) = u(p,-)u(p,-) = (B.120) 

u(p,+)u(p,-) = u(p, -)u(p, +) = (B.121) 

J2<P,h)u( P ,h) = jrf , (B.122) 



where I have defined u = u^°. Note that (B.119) is not Lorentz invariant since ip^ip involves 



a sum over dotted and undotted indices. However, 

W> = V a £ a + Cve, (B-123) 



is invariant since it involves contractions with like indices only. (B.120) is simply the statement 
that for massless spinors the upper and lower components are uncoupled. This is not the case 
for massive spinors, where the momentum operator itself couples dotted and undotted indices. 

Since ipip is invariant and ^ip transforms in SL{2, C) like ip itself, the combination 'ijjj^ijj 
transforms under the action of the Lorentz group as a vector (it is invariant under SX(2, C)). 
This means that ip'ip = ip^f a ip is just the time component of a 4-vector. The last relation is a 
statement about the completeness of our solutions. It is covariant in the sense that both sides 
transform like the SL(2, C) indices on a 7-matrix. 

Under Lorentz transformations, the lower components of ip transform according to ( B.82| ). 
The upper components transform like 

(S^)/ = {ffV t) / = K)/ = \ (a»a» a^)l , (B.124) 

so tp transforms according tcQ 



'a^ 



i 1 



S^= = -[y, = -a^ . (B.125) 







4 2 



Although this is not a hermitian operator and certainly cannot be identified with it does 
describe the action of the Lorentz group on classical fields ip. For a classical massless particle 

38 Since the (2 X 2) has dotted and undotted SL(2, C) indices and this new (4 X 4) cr^" has only the mixed 
'Dirac' indices, confusion should be rare. Throughout the text, the (2 X 2) a^ v does not make an appearence. 

39 We will see how this object is related to J in Section D.2. 
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(B.126) 



moving in the 3-direction, the relevant helicity operator is 

^J° V2 ) . 

V a'ft) 

The action of this operator on our u's tells us that u(p, +) represents a particle with helicity 
+1/2 and u(p, —) represents a particle with helicity —1/2. In this sense, massless spinors can 
be said to have spin-1/2. 

Dirac spinors are obtained by writing 



(B.127) 



with m some real constant. Since P 2 is a casimir of the Poincare group, consistency requires 
the relation 40 

(P • a) a0 r}p = m£ a . (B.128) 

Once again, we combine the two spinor fields into one 4-component spinor. Using the same 
notation, we write the equation of motion for massive spinors as 



m)^p = 



(B.129) 



For a spinor at rest, this equation reduces to 






m 


) 


— m 





m 





— m 





rn 





—raj 



Once again, we have two linearly independent solutions 



u(m, |) = sfrn 




1 

Vo7 



and u(m, J.) = \fm 



1 


\lJ 



The analogues of (B.119), (B.120), (B.121), and (B.122) are 



« t (p,t)tt(p,t) = « t (p,i)u(p,i) = 2p° 



(B.130) 



(B.131) 



(B.132) 



40 We could have a different eigenvalue. In that case, the mass of the particle we are describing would be 
V mm' . However, we can always re-scale the fields to obtain the same eigenvalue. 
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S(p,T)u(p,T) = u{pd)u{p,i) = 2m (B.133) 
B(p,T)u(p4) = S(pa)u(p,t) = (B.134) 
u(p, s)u(p, s) = ]ti + m . (B.135) 



Here, we have used the fact that ( B.132| ) is actually the time component of a vector and (B.132) 



is invariant under the Lorentz group to write the appropriate expressions on the right-hand- 



sides of these relations. Substitution of our expressions into (B.135) actually gives m-f° + ml, 
where 1 is the identity in Dirac space. Since the left-hand-side has the SL(2, C) indices of a 
7-matrix as before, we deduce that the correct extension of 7717° + m to other frames is j/+ m. 
This can be checked by transforming the solutions to other frames and explicitly calculating 
the completeness sum. 

Under Lorentz transformations, massive spinors transform via the same generators as mass- 



less spinors. The spin vector introduced in Section B.2 is given by the spatial components of 
S. Application of these matrices reveals that u(m, |) and u(m, |) represent particles with 
spin-1/2 up and down along the z-axis, respectively. Spinors representing particles with spin 
along other axes can be written as linear combinations of these solutions. 

The 7-matrices have many nice properties which make them easy to work with. The 
anticommutation relation, 

{-f,-f}=2<T , (B.136) 



can easily be proven using (B.lll). This relation tells us that to span the space of possible 
Dirac matrices we need only consider antisymmetric products. Our explicit expression for the 
antisymmetric combination of two 7-matrices, a^ v , tells us that this object represents six Dirac 
matrices that are linearly independent of each other. Since there are only four gamma matrices, 
the antisymmetric combination of three will have one missing. Multiplying by the square of 
the missing matrix (which is proportional to the identity), we see that this combination can 
be written 'y^°^ 1 <y 2 ^/ 3 . The antisymmetric product of four matrices must include all of them. 
Defining 

75 = l 7°7 1 7 2 7 3 = -Je Ml/a/3 7M7,7a7/3 , (B.137) 
we see that the space of Dirac matrices is spanned by the linearly independent set 

1 Scalar 
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7 M Vector 

a^ v Tensor (B.138) 
7 M 75 Pseudovector 
75 Pseudoscalar . 

After each matrix, I have noted the Lorentz transformation properties of the bilinear ipTip. 
The prefix 'pseudo' refers to the transformation properties under parity. The presence of e in 
the definition of 75 means that it has an extra sign change under parity. The properties 

(B.139) 
(B.140) 
(B.141) 
(B.142) 

imply that all these bilincars arc real except the pseudoscalar, which is purely imaginary. More 





= 7 ° 7 ^ 7 ° 




-,0^ui/ .,0 

= 7 ® 7 


(75) f 


= 75 


{7^,75} 


= 



relations between the 7-matrices can be found in Appendix A.l 

The matrix 75 is very useful in forming spin projection operators. The fact that 75 = 1 
along with its explicit form in the Weyl basis, 

75= f 1 ) , (B.143) 



1 



imply that the operators 



PR = \ (1 +75) Pl = \ (1 - 75) (B.144) 

project out the undotted and dotted, or positive and negative helicity states, respectively. 
Furthermore, it can easily be checked that the bilinear 

u(p, s) 7 At 7 5 u(p, s) = 4ms M (B.145) 

returns the spin vector for massive particles. This formula is also valid for massless fermions, 
provided the replacement ms 11 —> hp^ is understood. For massless particles, the projection 



operators are extremely useful. According to (B.122), the bilinear 



u(p, h)u(p, h) = 1 (1 + 2^75) i> . (B.146) 
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Here, h is the true helicity of the fermion, taking the values ±1/2. 

Up until this point, we have focused on positive energy solutions, i.e. p a > 0. However, 



for reasons which will become clear in Appendix D.l, we must also consider the case in which 
p° < 0. Here, we define fc M = — p^ and write 



?v{k,h) =tfv(k,h) = - tfv(k,h) = 



for massless fermions and 



tf*v(k, s) =f>v(k, s) 



:v(k, s) — mv(k, s) 



(B.147) 



(B.148) 



for massive. From Equation ( B.147 ), it is clear that the solutions v(k, h) for massless fermions 
correspond exactly to the positive energy solutions : 



v(f,h) = u(-p,-h) 



(B.149) 



Calling the objects represented by u fermions and those represented by v antifermions, this 
relation implies that an antifcrmion with positive helicity moving along the positive z-axis is 
equivalent to a fermion with negative helicity moving along the negative z-axis. 

For massive fermions, the correspondence is not so simple. In this case, the Dirac equation 
becomes 

/ m m \ 
m m 
m m 
V m ml 

in the antifermion rest frame. Once again, we have two linearly independent solutions which 
we can take as 

( ° \ 



v(k,s) = 



(B.150) 



v(k, i) = \pm 



! 1 \ 

o 

-l 
V o J 



and v(k, f) = \/m 



1 



V-i7 



(B.151) 



These solutions also refer to a spin-1/2 object, but the spin-up and spin-down solutions have 
changed roles owing to an extra sign in the Pauli-Ljubanskii vector. Here, we have the relations 



«t(fc,tMM) = v^k,i)v(k,i) = 2k° 



(B.152) 
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v(k, t>(fc, T) = S(fc,iM*,i) = -2m (B.153) 

v(kj)v(k,i) = v(k,l)v(k,1) = (B.154) 

v(k,s)u(p,s') = u{p,s)v(k,s') = (B.155) 

^u(fc,s)uO,s) = -(jf+m)=#-m. (B.156) 

•=i 

Explicit forms can be found for both u(p, s) and v(p, s) by boosting the rest frame solutions 
via flB.125| ). The result is 



/ VP • a 6> \ ( vk^Z-s \ 

u(p,s)=\ and v(k,a)=[ , (B.157) 

where £ s is a 2-component spinor aligned with the direction s, 

. (B.158) 

These solutions are constructed so that in the rest frame s ■ W is diagonal. The square-root of 
a matrix is understood as an instruction to take the positive square root of each eigenvalue. 
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Appendix C 



SU(N) 



The special unitary group in N dimensions, SU(N), has many applications throughout physics. 
For N = 2, we have a group structure which is homomorphic to the rotation group, 50(3). 
Its representations house non-relativistic fields with intrinsic angular momentum, or spin, as 
well as the orbital excitations of molecules. Taking N as the number of quark flavors allows 
us to represent the approximate flavor invariance of the QCD lagrangian. This symmetry 
was the first true indication that hadrons are composite. SU{2rif) describes the approximate 
spin-flavor symmetry of a theory with nj extremely massive quarks. Most recently, SU(2) 
and SU(3) along with the sister group f/(l) have been useful in describing the full symmetry 
group of the Standard Model. Although this thesis is most concerned with the group SU(3) 
and its role in strong interaction physics, we will find it useful to familiarize ourselves with the 
general structure of this family of groups. 



C.l General Properties 

In this section, we will discuss the definition and some of the mathematical properties of 
SU(N). Since we are concerned mainly with the fundamental representation of SU(N), we 
begin with a discussion in this context. The fundamental representation of SU(N) can be 
realized by the group of unitary N x N matrices with determinant +1. Suppose the vector q l , 
i = 1 to N, transforms in the fundamental representation of SU(N). Then 

q* (Uq) 1 . (C.l) 
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This implies that the complex conjugate transforms as 

(?*)• -> (((7g)T = (^V)* = ((E^V)' = (q*U^y . (C.2) 

Since U is unitary, the product q*q is invariant under the transformation. In analogy with 
the Lorentz group, let us denote (q 1 )* = Objects which transform as U are said to be in 
the N of SU(N), while their conjugates are in the N. Since q*q — qiq % — qiq-'S^ is invariant, 
we have an invariant Kronekker delta. The invariance of this object is directly related to the 
uniterity of the elements of SU (N) , which prompts us to look for an invariant related to the 
determinant of U . Just as in the case with the Lorentz group, the totally antisymmetric tensor 
e u—iN dogg the trick. These are the only two invariants in the fundamental representation of 
SU{N). 

Now that we have discussed the objects these transformations act on, let us parameterize 
a general element of the group in this representation. Any unitary matrix can be written 

U = e lH , (C.3) 

where H is a hermitian matrix. Using the matrix identity [] log det U = Tr log U, we find that 
dot U = 1 TriJ = 2n7r, with n € Z. Any N x N matrix can be decomposed into traceless 
part and a a trace : 

H= U-^TrH) +^TrH . (C.6) 

Hence our unitary matrix U is the product of the exponential of a traceless matrix and a 
phase, e 2 ™"/". Since this phase can only attain certain discreet values, one cannot move 
continuously from one value of n to another. Furthermore, only the set with n = is a 



1 This identity can be proven for any matrix which is the exponential of a complex multiple of a hermitian 
matrix as follows. Given 

M = e aH ; 

U^HU = H , (C.4) 

with H diagonal and U unitary, we see immediately that 

det (U^MU) = det M = det e ail = e aTrfl = e a M = e Tr lo * M . (C.5) 

The validity of this identity rests mainly on two facts : that hermitian matrices can be diagonalized by a 
unitary transformation and that the relation is trivially true for diagonal matrices. 
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subgroup since it alone contains its inverses and the identity. I will call this subgroup the 
proper SU(N) transformations and subsequently omit the 'proper'. The whole group can be 
considered (when it is absolutely necessary to do so) as the product of this SU(N) with the 
discreet U(l) group e 2m7T / N 7 n € Z. Some of the consequences of this phase to the global 
properties of the SM are discussed in ]9l| . 

We are now left with the task of parameterizing a traceless N x N hermitian matrix. Such 
a matrix has (N 2 — N) complex off-diagonal elements constrained by (TV 2 — N)/2 complex 
hermiticity constraints and N real diagonal elements constrained by 1 condition on the trace. 
In other words, they have (TV 2 — 1) independent elements. Any set of (N 2 — 1) linearly 
independent traceless hermitian N x N matrices will act as a basis for this space. Since they 



exhaust the space of H, they generate all of the elements of SU(N) through (C.5). The most 
natural way to choose these generators isQ 

{H)) = A l k (Mi)) ; 
(Mh) = StSi-^Sty , (C.7) 

since this choice does not introduce any new objects. All we are using to construct our 
generators are the invariants we have from the above discussion. Different choices of the 
parameters A k will lead to different elements of the group. 

Although there are in principle N 2 matrices M, they are not all linearly independent; they 
are related by 5*-(Mj )j = 0. This little inconvenience leads to several problems, not the least 
of which is that the A's contain more degrees of freedom than required. Another difficulty 
is that our generators, as it stands, are not hermitian. Consider the hermitian conjugate to 
(Mf ) 

((Afftt)} ~ (Mf)l ■ (C.8) 

This matrix has the same entries (if we use the convention that 'upper' indices label rows 
while 'lower' indices label columns) as (M^,)*. Hence the hermitian matrices {Mf + M l k ) 

2 Since q 1 — > (Uq) 1 = Ujjqi , our matrices must have one index in the fundamental and one in the conjugate 
representation. 

3 We use ~ rather than = here because the indices i and j are in different spaces on either side of the 
equation. 
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and (Mi — M l k )/i should in principle be used as generators rather than M ; fc and M l k . These 
generators will have coefficients which are strictly real, while our original 'natural' choice will 
have complex coefBcients constrained by the condition (A*)* = Aj. Although this is just the 
statement that complex conjugation interchanges the roles of the fundamental and complex 
representations, it is not easy to implement in practice. Since the sum M ; fc + M k makes sense 
only for specific values of k and these indices can no longer be thought of as truly in any 
representation of SU (N) . This fact leads to many complications if one wishes to consider N 
large. However, in practice we seldom need to consider N > 3 so the hermitian choice is often 
made. 

Let us consider generators which are all hermitian and which do not overcount the relevant 
degrees of freedom. Taking our new generators to be t a , a = 1 to (A^ 2 — 1), we write 

U = e l9ae . (C.9) 

The 9 a 's are in a new (A^ 2 — l)-dimensional space. As with the Lorentz group, these objects 
transform in the adjoint representation of SU(N). Note that in the adjoint representation 
there are no 'upper' and 'lower' indices since the #'s are real. 

Since we are working within a group, products of transformations are themselves transfor- 
mations. In particular,^ 

= 1 + \ 2 [oj,9] +C(A 3 ) (CIO) 

is an element of the group and hence is expressible in terms of the generators. This implies 
for small A that 



anb\i_a >b 



U) 



[t a ,t b ]=ir] c t c (C.ll) 



for some n. Differentiating with respect to to and 9, we obtain 

[t c \t b ] =if abc t c , (C.12) 
where I have introduced the f abc, s as the relevant derivatives of r\. The f abc, s are called the 



structure constants of the group, and Equation ( C.12 ) is said to define the Lie algebra of the 

4 We cannot have k or I in both the fundamental and the conjugate representation! 
5 I have suppressed the adjoint indices via 6 = 9 a t a . 
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group. The structure constants are obviously antisymmetric in their first two indices, but we 
can make them totally antisymmetric by choosing the t a, s to be orthogonal in the sense that [] 



Ti t a t b = T F S al 



(C.13) 



where S ab is the Kronekker delta in adjoint space and Tp is some positive normalization. With 
this choice, 

f"bc = — L Tr [<V fc ]£ c . (C.14) 
Tf 

Total antisymmetry follows from this identity and the cyclicity of the trace operation. 
Since the Jacobi identity 



[t a ,[t b ,t c }] + [t b ,[t c x}} + [t c ,[t a A} = o 

is trivially true, we have a consistency relation for the f abc '§ 

jabe jcde _|_ jb c e ja d e _|_ jca e jb d e q 



(C.15) 



(C.16) 



Let us calculate the transformation law in the adjoint representation. Since the 8 a, s and 
the Ays play the same role, under an SU(N) transformation 9 — ► U6UK For infinitesimal 
transformations, we have 

6->O + i[u),0\ , (C.17) 

or 

e a ^e a - f abc u b e c . (c.i8) 

Since this is an SU(N) transformation, it can be written 6 a -> U ab b 1 with U given by 
Ta , exactly in the same way as before. The only difference is that now the generators are 
(TV 2 — 1) x (TV 2 — 1) dimensional matrices. This implies the identification 



{T%) bc = -if al 



(C.19) 



from which standpoint the Jacobi identity, (C.16), is simply a re-expression of (C.12) in the 
adjoint representation. 

6 The normalization is not important here. It need only be the same for all generators to induce fully 
antisymmetric structure constants. In some sense, choosing the same normalization for all of our generators 
restores the symmetry we had to break to obtain a basis which does not overcount the relevant degrees of 
freedom. Any other choice would have a favored direction in adjoint space. 
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Since the Kronekker delta and the structure constants seem to be properties of the group 
itself rather than a specific transformation, we expect them to be invariant under transforma- 
tions. We can check this by explicit calculation : 

b ^ b _|_ jacdgd^cb _|_ jbcdgd^ac 

= 6 ab 

jab c ^ j?ab c _ ^jbce jade jcae jbde jabe jcde^Qd 20) 

= f abc , (C.21) 



where the last equality follows from the Jacobi identity, ( p. 16 ). In the adjoint representation 



the generators are the structure constants, implying that the generators are invariant under 
the action of the group if we rotate all of their indices. This is obvious with the natural choice 
for the generators, ( |C.7| ), since there they are expressed solely in terms of the invariant 5j. 

There is one other invariant in the adjoint representation which will be useful to us. This 
object appears because of the fact that the anticommutator of two generators is a hermitian 
(N x N) matrix and so can be represented as a linear combination of the generators themselves 
and the identity. We write 

{t a ,t b } =a ab + d abc t c . (C.22) 

Taking the trace of both sides, we find that a ab = 2Tp/N 5 ab . The d abc, s are, with our choice 
of normalization, fully symmetric since 

d abc = 1 Tr Ua t b} t c (C.23) 

These structure constants transform according to 

^abc ^ ^abc ^jade^bce _j_ jbde^cae _|_ jcde^abe^Qd 24) 

An identity similar to Jacobi's, 

[t a , {t b , t c }} + [t c , {t a , t b }] + [t b , {t c , t a }} = , (C.25) 

can be employed to show that the quantity in parentheses is in fact zero. These definitions 
allow us to write 

t a t b = -({t a ,t b ] + {t a ,t b }) = -l-^S ab + d abc t c + if abc t c ) , (C.26) 
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a very useful result. 

Since we have chosen the same normalization for all of our generators, there is no preferred 
direction in adjoint space. Any quantity which does not depend on an external adjoint quantity 
must be expressed in terms of invariants, in exactly the same way that Lorentz covariant 
quantities which do not depend on external vectors must be expressed in terms of the metric 
and the Levi-Cevita tensor. This is a representation independent result since it has only to do 
with the way we have normalized our generators. Since all other representations can be built 
from the fundamental representation, normalizing the t a induces a normalization on Tj| for 
any representation R. This normalization is given by 

TY R T%T b R = T R S ab , (C.27) 

where we have used the fact that the left-hand-side is independent of any external adjoints 
and hence must be expressed in terms of the invariant S. 

The value T R depends only on the representation the trace is performed in. For example, 
let us calculate T4Q 

Tr A T a A T h A = f acd f bcd = T A S ab . (C.28) 



The sum on the /'s can be expressed in terms of the i's using (C.14) : 



1 



f acd f bcd = --^Tr[t a ,t c }t d Tr[t b ,t c ]t d ■ (C.29) 

F 

(t a )'j(t a )f is an object with two fundamental indices and two conjugate indices which does not 
depend on an external vector, hence it can be expressed in terms of the available invariants, 
SjSf and S^Sj . Since both matrices are traceless, we write 

(t»)5(Of oc^-i^f . (C.30) 

Contracting with 8 j k 8\ gives Tr t a t a = T F S ab 6 ab = (N 2 - l)T F , so 



(t a )K* a )f =2>(^*-i**tf ) . (C.31) 



This useful relation is known as a Fiertzfj identity. Using it we can expand the trace and do 
the sum, obtaining 

T A = 2NT F . (C.32) 

7 Conmsion between the generators Tj^ and their normalization Tr should be rare. 
8 pronounced almost like fierce 
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The operator {T^T^) a p has two indices in the representation R and no adjoint indices. 
Hence it must be proportional to the identity in R, i.e.^j 

CTjjTjj) /9 = CnSa/s ■ (C.33) 

It is obvious from the Fiertz identity that Cf = Tp(N 2 — l)/N and Ca =Ta- As with the 
operators P 2 and W 2 in SO(3, 1), this operator is a casimir for SU(N). From here, one can 



calculate many group structures. I list a few useful relations in Appendix A. 2 for reference. 

Explicit forms for the structure constants can be obtained for general SU(N) if one uses 
the 'natural' choice for the generators, Mj. It is straightforward to check that 

(Mj)*(i\C)f = SfaS? - ^5 k m S? - ±6*5Z5i + ^ ■ ( C - 34 ) 

The normalization of these generators is obtained via 

(Mi) k p (M^)f5 l k = (Mj)« . (C.35) 

The interpretation of this relation is obvious once one considers the symmetry of the situation. 
Above, we normalized our generators according to Tvt a t b = TFS ab . Here, it is not yet clear 
what form 6 ab will take. In the most fundamental sense, S ab is simply an invariant tensor with 
two indices in the adjoint representation. By inspection, one can see that in this representation 
for the generators adjoint indices consist of traceless pairs of fundamental and anti-fundamental 
indices. The only such invariant object at our disposal is (MJ)^ itself. Indeed, these two 
conditions were the reason this form was chosen for M in the first place! Hence we are led to 
the identification 

5 ab -> (M}) n m . (C.36) 
Proper normalization is obtained by the requirement that 

iV 2 - 1 = (MX (Ml )™ , (C.37) 



which can be checked explicitly. With this in mind, ( |C.35 ) implies that are generators are 
normalized according to Tp = 1 . 

9 The fact that I have written both indices downstairs here has no meaning. What is required is the same 
index structure as the generators have in the representation R. 
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From here, we are free to calculate any structures we wish. For example, 



-7- ) ,V 



C F 5 i j = (MFy p (MZT j (M l k )™=(N 



N 



(C.38) 



Furthermore, the explicit form of the product ( C.34 ) allows a direct calculation of the anti- 
symmetric 



[Mf, Mi 



7 f ijk M n ■ 

1 J Iran 1V1 k 1 



ijk 
Iran 



and symmetric 



(C.39) 



Iran 



N 

2 



(C.40) 



-J^Sl&ra&n ■ 



structure constants. Note that these constants have the stated symmetry properties only under 
the exchange of pairs of indices. Note also that was chosen to have this symmetry. Two 



of its terms cannot contribute to (C.4C). Armed with these expressions for general N, we can 
directly verify any of the expressions in Appendix |A.2| . Proper normalization to any value of 
Tf is trivially performed by rescaling the generators]^] 

As a last topic for this section, let us consider the problem of finding irreducible represen- 
tations of SU(N). Suppose we wish to consider (N x N), 



(C.41) 



Since there are no invariants with two fundamental indices, the decomposition of this tensor 
is trivial. We simply symmetrize and antisymmctrize its indices : 



The symmetric representation has N(N — l)/2 + N = N(N 
antisymmetric has N(N — l)/2. In this way, we arrive at 

NxN ^N(N + l) + N(N-l) 



(C.42) 

l)/2 components while the 
(C.43) 



■'As far as normalization is concerned, / and d count as generators. 
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Note that we don't need to subtract a trace here; this concept has no meaning in our space. 
Larger numbers of indices in one representation are handled in an analogous way. One simply 
adds indices successively to the decomposition in ( p. 42 ), symmetrizing and antisymmetrizing 



as one goes. For example, (N x N x N) is decomposed as 

Jiijk ^ J^(ij)k _|_ <j,[ij]k 

^ J^(ijk) _|_ j>(i[j)fc] _j_ J>[i(j]fc) _\_rp[ijk] _ (C.44) 

For 5/7(3), this decomposition corresponds to 

3x3x3 = 10 + 8 + 8 + 1. (C.45) 

This result is relevant to the study of baryons both in SU (3)/, where the 8 corresponds to the 
octet of spin-1/2 and the 10 corresponds to the decouplet of spin-3/2, and the color SU(2>) 
gauge group, where the 1 is the colorless singlet required for a physical state. In SU(3), the 
totally antisymmetric combination T^ k ^ is obviously invariant since it can be written in terms 
of e 1 ^ . This correspondence allows us to clearly see the fact that 

= e h-iN f iN ( C 46 ) 

transforms as an N rather than an N. 

The decomposition of N x N is quite different. Here, we have the invariant which leads 
to a trace subtraction. In this case, it is the concept of symmetrization and antisymmctrization 
which does not make sense. This leads to the decomposition 

The trace is invariant and forms the representation 1 of SU {N) , while the remainder transforms 
in the adjoint and forms the (N 2 — 1). Hence 

N x N = (N 2 - 1) +1 . (C.48) 

This decomposition is relevant to meson physics, where the SU(3) / group tells us we will have 
an octet (the 8) and a singlet (the 1) and the 5/7(3) color group tells us that the combination 
of a quark and an antiquark can produce a physical color singlet state. 
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C.2 Local SU(N) 

As detailed in Chapter [l], QCD is a quantum field theory based on a local SU(3) symmetry. 
One considers several flavors of quark triplets in the 3 and antiquark triplets in the 3, and 
requires the lagrangian to be invariant under local SU(3) transformations. This leads to the 
introduction of the covariant derivative, T>. If we just compared the field at two different points 
(using the ordinary derivative), differences in convention between the two points would lead 
to physically unimportant contributions. The covariant derivative takes all of these changes in 
convention into account and asks for the difference between physical configurations. However, 
it can only do this for two infinitesimally close points. 

Suppose I want to compare field values at two points which are a finite distance apart. 
Since these objects are defined with different conventions, we must first change the convention 
of one of them to match the other convention. This operation is called parallel transport. The 
changes in convention occur locally, so we will need to know the convention at every point on 
a path between the two points to perform this transformation. The covariant derivative tells 
us that local changes in convention are accounted for by the gauge field A. Hence we expect 
parallel transport to involve some sort of nonlocal combination of gauge fields. The infinite 
number of paths connecting two points in 4-dimensional spacetime make it necessary for us to 
choose a curve, C, to define a unique parallel transport operator. Defining C by points z(r) 
such that z(0) = y and z(X) — x, it can be shown that the operator 

Qc{x,y) = Pe~ is £ ^(*HX^W dr (c 4Q) 

satisfies the requirements for parallel transport from y to x along C . The P-symbol orders the 
A fields such that those corresponding to larger t come first, i.e.0 

p| e / lM(r)dT | = l + J drM{j)+ J drM(r)J dr'M(r') 

+ [ drM(r) f dr'M(r') / dr"M (r") + • • • . (C.50) 
Jo Jo Jo 

It may be checked explicitly for infinitesimal gauge transformations that 

Qc (x, y) - U(x)G c (x, y)U ] (y) (C.51) 



The 1/n! in the expansion of an exponential appears naturally here if the path-ordering is trivial since 

2 

commutes with itself for different values of - 



Jo dT Jo dT 'f(^f( T ') = 1/2 ( Jo */(t)) if f(r) 
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for any choice of the path C. Since all gauge transformations we consider can be obtained from 
repeated application of infinitesimal ones, this is all we needj^] Qualitatively, we can think of 
( C.49| ) as iteratively taking changes in convention into account. The first term corresponds to 



no change in convention, the second takes a change in convention into account at one point 
along C and sums over all possible places the convention could be changed, the third changes 
conventions twice and sums over the positions of those changes, etc. The path ordering assures 
us that changes in convention are made in the correct order as we traverse C . 

The comparison of field values at two different points in spacetime may now be written 

(A c ilj)(x,y)=iP(x)-gc(x,yW(y) . (C.52) 

From the transformation properties of Q, it is obvious that (Aip)(x,y) transforms like ip(x). 
Of course, we may define this difference at any point in spacetime by appropriate use of Q : 

(Ac,c0) (x,y) \ z = Q c >{z,x)ij){x) - g c +c(z,y)ip(y) , (C.53) 
where C" is some path connecting x and z. The parallel transport operator Q is commonly 



called a gauge link and is very useful in defining nonlocal distributions, as in Section 2.3. 

Different paths used to define Qc are not equivalent. The difference between them can be 
characterized by the Wilson loop, 

U e (x) = Gc'(x iy )gc(y,x) . (C.54) 

The loop £ forms a closed curve containing the points x and y and defined by the paths C 
and C . Expanding this object about g = 0, we see that its terms arrange themselves into 
path-ordered loop integrals : 

Ui(x) ~ 1 — ig <z> A(z) ■ dz 



-ig 2 f t c J A a (z{T))-(dz/dr)dT J A b {z{T'))-(dz/dT')dT' 



r 2 

2 



-g 2 A(z{t)) -{dz/dT)dT A{z{T l ))-{dz/dT l )dT l +0{g A ) . (C.55) 
Jo Jo 



1-2 , 



As stated before, other transformations can be accounted for by rotations in the other sectors of the SM. 
Topologically nontrivial gauge configurations are associated with behavior at infinity and do not concern us 
here. 
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Here, z^(t) traces out C for r e [0, 1) and C for re (1,2]. 

In abelian theories, one can use Stokes' theorem to convert these integrals into surface 
integrals over the area £ bounds. The same spirit is present in non-abelian theories, although 
the difficulties associated with noncommutativity make it less transparent. To see this, we 
consider an infinitesimal loop. Taking y = x + dx^ + dx v , where \i and v are for the moment 
fixed directions, we can evaluate the difference in choice between a path that goes first along 
dx^ then along dx v and a path that does the opposite by approximating the integrals : 

A{z)-dz ~ A tt (x)dx li + A v {x + dx^)dx v 

-A v {x)dx u - A"(x + dx u )dXf, (C.56) 
= (d»A v - d v A' x ) dx^dxy , (C.57) 

where I have suspended the Einstein summation convention for the moment. Similarly, the 
non-abelian commutator term in ( |C.55 ) is approximated as 



-ig 2 f abc t c f A a {z{r)) ■ (dz/dT)dr f A b {z{T r )) ■ (dz/dr')dr' 
Ji Jo 

^ig 2 f abc t c A v a {x)A' 1 b {x)dx u dx^ . (C.58) 

Combining these two contributions, the integral is expressed 

-igT^ u {x)dx^dx v . (C.59) 

The antisymmetric combination of the differentials gives an infinitesimal area element - the 
area of the bounded region. This statement is valid for any choices of /i and v, allowing 
us to apply it to the discretization of any spacetime region bounded by curves C and C. 
Furthermore, this analysis can be extended to higher orders in the coupling. 

The fact that T itself is not gauge invariant makes it difficult to compare infinitesimal 
contributions from different regions. However, the gauge-invariant trace of hi admits a full 
generalization of Stokes' theorem : 

Tr Ut = Tr e' tg /s ^ da ^ , (C.60) 

where S is any surface bounded by i and da^ iV is the four-dimensional area clement. This 
object can be used as a gauge-invariant measure of the strength of the gauge field in a region. 
It is also one of the fundamental building blocks of the discretized theory of lattice QCD. 
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C.3 The Limit of Large N c 



Some very useful results have come from considering a local gauge theory based on the group 
SU(N C ) in the limit of large N c . In this section, I will briefly overview some of the main results 
and simplifying features of this limit. Some useful references on this subject are p2| . 

The lagrangian of large-A^ QCD is identical to that of normal QCD, except that here 
there are N c quark colors and (JV^ — 1) gluons. As we have seen above, adjoint objects 
such as gluons can be represented as the traceless combination of fundamental and an anti- 
fundamental indices. Since we intend to take N c to infinity, the fact that the trace degree 
of freedom should be subtracted out is of no great importance. Hence, with corrections of 
order 1/N%, we can consider a gluon as a quark-antiquark combination.^ The replacement of 
all gluon propagators with double quark propagators, with fermion number flow in opposite 
directions, makes it easy to count powers of N c . Each unconstrained loop gives a factor of N c 
when its colors are summed over. For example, Figure C.la comes with the factor N£ because 
there are two unconstrained 'fermion' loops. Unless we can regulate this behavior in some way, 
it quickly becomes obvious that each higher order of perturbation theory will contain diagrams 
with more powers of N c . This is certainly not the hallmark of a theory with a well-defined 
limit as N c — > oo. 

By inspection, one can see that factors of iV c are always accompanied with factors of , 
where g^ c is the coupling constant of our theory. Since this is our theory, we can force g to 
scale with N c in any way we want. Comparison with the real world requires only that g% has 
the value measured in experiment. With this in mind, we see that our problem can be solved 
by taking 



This definition causes all of the leading powers of iV c at each order in perturbation theory to 
contribute at the same power of N c to the final amplitude under consideration. 

Now that we have a well-defined limit, it behooves us to find out which diagrams contribute 
in this limit and which do not. Since each power of the coupling contributes a l/y/N c to 
the amplitude, leading diagrams have the property that each coupling generates 'half of an 

13 This is to be understood as taking place exclusively in the SU(N C ) sector of the theory. 
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(a) 




Figure C.l: An illustration of the large- N c counting rules, (a) is enhanced by a power of N% 
because of the unconstrained color loops within the bubble. Since it is a two-loop graph, it 
will contribute to the leading large- AT C behavior. On the other hand, (b) contains only one 
unconstrained color loop. As a three-loop diagram, its contribution comes only at 0(1/N^). 
Notice that (a) is a planar diagram, while (b) must contain a self-intersection when forced into 
a plane. As it stands, the counting rules for more complicated correlation functions, like that 
represented in (c), are ambiguous. At first, we may think that it is planar. However, drawing 
all external lines directly to the open dot (representing infinity), reveals its true nature. 
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unconstrained color loop. Real quarks have only one color line, so quark loops cannot give 
rise to factors of N c even though they are associated with factors of . This implies that all 
diagrams that involve quark loops are suppressed by N c . This is easy to understand because 
there are simply more gluons around than quarks. Gluons^] will not always lead to the maximal 
power of -ZV C either. By inspection, it can easily be seen that diagrams like that in Figure C.lb 
in which the color loops are entangled are subleading. This can be extended to a rule - all 
diagrams which possess self-intersecting color lines when embedded in a plane are subleading 
in the large- N c limit. Since these diagrams cannot be topologically embedded in a plane, they 
are referred to as 'nonplanar'. This rule works for loops, but must be amended to include 
correlation functions with several external particles. Here, there exist diagrams which can be 
drawn such that they appear either planar or nonplanar depending on preference. However, 
when all external lines are drawn directly to the same point at infinity, only the planar diagrams 
will contribute to the leading large- N c behavior. This rule is illustrated in Figure C.lc. 

Beyond our rule for leading diagrammatic contributions, not many results of large-iV c 
physics affect the topics presented here. However, in hadronic physics this limit represents an 
extremely useful simplification. Among other things, it constrains meson-baryon couplings via 
uniterity relations, allows one to justify certain hadronic models, like the Skyrme model, and 
makes it easier to relate the macroscopic world of hadrons to the microscopic one of QCD. A 
new symmetry between the light quark flavor and spin becomes manifest in this limit. This 
'spin-flavor' symmetry arises because baryons composed of N c — ► oo quarks do not care if you 
flip the spin of one or two. This leads to a symmetry between the low-lying baryonic spin 



excitations 93 . In particular, it relates the A resonance to the nucleon. Corrections to this 
result can also be systematically calculated, a situation hard to come by in hadronic physics. 
Furthermore, an even larger symmetry group encompasses the baryons if we simultaneously 
take some of the quark masses to infinity [BjJ, 



14 In covariant gauges, one must also include ghosts. Since they appear as adjoints along with the gluons, 
their contributions can also give leading effects. 
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Appendix D 



Quantum Mechanics 



The fundamental postulate of quantum mechanics, 

[x,p]=ih , (D.l) 

is interpreted in quantum field theory as a statement about the commutator of a field operator 
with its conjugate momentum. The process known in the sixties as 'second quantization' 
involves promoting the wavefunctions of quantum mechanics to operators and imposing the 



'canonical commutator' (D.l) on these wavefunctions and their conjugate momenta. The 
purpose of this appendix is to quickly review this process and obtain explicit expressions for 
the generators of the Poincare group in a quantum field theory. 



The analysis of Section D.2 closely follows that in S. Weinberg 

D.l Canonical Quantization 

In ordinary quantum mechanics, the Schrodinger equation, 

, K 2 



mj-<f>(x,t) =-— W 2 c/ ) (x,t) + V(x)^x ,t) (D.2) 
at Zm 



is just the statement that 



^2 

E=^- + V(x) , (D.3) 
2m 



along with the added understanding that the quantities E and p are operators on the function 
space that satisfy the commutation relations 

[x»y\=-ihg> iV . (D.4) 
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Equation ( p. 4 ) follows from the solution 

= id" (D.5) 



of (D.4). 



Unfortunately, a direct application of ( p.5| ) to the relativistic version of ( p-g ), 

p 2 =m 2 , (D.6) 

leads to a Hamiltonian whose spectrum is not bounded from below. Hence our theory does not 
possess any stable configurations. The solution to this dilemma has to do with interpretation. 
In non-relativistic quantum mechanics, the dynamical variables are x and t. One requires <f> to 
be an eigenfunction of the energy operator, which depends on the position x. In quantum field 
theory, the dynamical variables are the fields <j> themselves. Since t is no longer a dynamical 
variable, we no longer have any need to bound its Fourier conjugate from below. This is 
the fundamental distinction between non-relativistic quantum mechanics and quantum field 
theory. 

The dynamical variables of any quantum system satisfy canonical commutation relations 
with their conjugate momenta. In order to define this statement, we must introduce a la- 
grangian for our system. A lagrangian is dictated by the equations of motion it generates, so 
the first question we must ask ourselves is what kind of equation of motion we would like our 
fields to satisfy. At some point, we intend to consider freely propagating massive particles, so 



a logical choice for the equation of motion is (D.6) 



(-d 2 - m 2 ) 4> = . (D.7) 
This equation can be seen to follow from the classical lagrangian density^ 

£ = i0 / W-imV , (D.8) 



from which we obtain the conjugate momentum 

6C 



5da<j> 



= d Q <t> . (D.9) 



1 The method of going from lagrangian densities to equations of motion is discussed in the next section. 
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The canonical commutation relation^ 



iS(x- y) 



(D.10) 



can be solved by expanding (f>(x) in a complete basis of solutions to (D/7) with operator 
coefficients Q 

d 4 p 



age 



(2^)4 

Consistency with ( D.f 0| ) requires 



- ip - x + ale ip - x 



/ 2p0e(p°)(2 7 r)5(p 2 -TO 2 



= (2n) 3 S(p- p') 



(D.ll) 



(D.12) 



Several comments are in order here. First, the canonical commutation relation ( D. 10| ) must 
be imposed only at equal times; any other choice would violate causality.^ The presence of the 
(5-function is understood immediately once we realize that we are quantizing a continuously 
infinite number of field densities 4>(x). Our commutation relations must be such that the 



integral over a small region about x gives one. Second, the form of equation ( D.ll ) is dictated 
by the reality of 4>{x). Complex fields will in general have different operators multiplying their 
positive- and negative-frequency solutions. Note that I have included the negative frequency 
solutions with the positive ones by changing the sign in the exponential. 
The field <fi, along with 
d 4 p 



tt(x) 



(2tt)< 



ate ip - x 



PW9(P 1 )(2^ (p 2 - m 2 



(D.13) 



can be used to form the true Hamiltonian of the system, 



H 



d 3 x [n{x)d°<j)(x) - C] 



d 3 p 
(2g3 



\/p 2 + m 2 



4a ? +i(27r) 3 ^)(0) 



(D.14) 



2 h was reinstated at the beginning of the Appendix for nostalgic purposes; from now on, it will be given 
the value 1. 

3 The hermiticity of <f> relates the two operator coefficients, as shown. The factor of \J 2p° is for later 
convenience. It belongs to the operators and at^ as the combination is invariant. 

4 Actually, this statement is not strictly true. Lorentz invariance implies that we can impose this relation for 
equal coordinate along any timelike vector we so choose. In the limit of large boosts, we can even quantize on 
the light cone. However, once we have chosen a direction and imposed an equal 'time' commutation relation, 
commutation relations at different times are fixed and can no longer be imposed. 



269 



The second term in this expression represents the sum of the zero-point energy of a continuously 
infinite number of quantum harmonic oscillators. This contribution to the energy is completely 
independent of the state we consider, and as such cannot contribute to any physical energy 
differences. For this reason, we will ignore it completely. The rest of the Hamiltonian involves 
the energy of a particle of momentum p, 



E f = y/p 2 + m 2 , (D.15) 

and satisfies 

[H,a0\=-E ff a (D.16) 

= E p a\ . (D.17) 

Assuming a ground state |0) that satisfies 

OjflO) = , (D.18) 

we can construct a whole tower of states 

(alY' 

\nm,n 2 p 2 , • • • , n N p N ) = TT (2% )"' /2 V V jJ= |0) (D.19) 
which have the positive energy 

N 

H \nipi,n 2 p 2 , ■ ■ ■ , n N p N ) = ^ UjEp. \n 1 p 1 ,n 2 p2, ■ ■ ■ , n N p N ) . (D.20) 
Our states are constructed to have the covariant normalization 

(p\p') = 2E f! {2nf 5 (p- p') (D.21) 

for each particle^] 

These states have exactly the properties we expect of a state of rij free particles of mass 
m and momentum pi. Apparently, we are encouraged to interpret the operators and ap 
as creating and destroying a particle, or field quantum, of momentum p. Quite unexpectedly, 
we have come up with a theory which does not have a fixed particle content. This fact, which 

5 For states with more than one particle of the same momentum, this normalization will lead to divergent 5- 
functions. However, any truly physical process will involve wavepackets rather than real momentum eigenstatcs. 
The averaging involved here will smear the 5-function. 
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stems directly from the change in wavefunction interpretation from innocent state vector to 
dynamical variable, represents a major shift in the way we view physics as a whole. 

The field <f>, known as the Klein-Gordon [ p5| held, is a scalar field because it is invariant 
under the action of the Lorentz group. Although this makes it a good starting place for a 
discussion on the quantization of field theories, its appearence in physical theories is rare at 
best.^] Physical theories involve spinor and vector fields almost exclusively. For this reason, 
we now turn to the Dirac field. 



According to the presentation of Section B.3, Dirac fields satisfy the equation of motion 



fii{x) = ijtip(x) = mij){x) . (D.22) 

This equation follows from the lagrangian density 

C = i}(i^-m)il} , (D.23) 

from which we can deduce the momentum conjugate to tp : 

a r 

= . (D.24) 



6d ip 



Anticipating a field quantum for t/j, we use the complete basis of solutions to (D.22) found in 
Section [B.3| to expand 

*/>(*) = j E [ 6 ft«"(P> S >' W ' X + d \AP' s y P ' X ] yVe(p°)(2^)<5 (p 2 - m 2 ) . 

(D.25) 

We hope to identify the coefficients in this expansion with particle creation and annihilation 
operators. 

The Dirac field is not hermitian, so we have no right to assume a relation between the 
creation and annihilation operators. However, as e~' lp ' x (e +lp ' x ) carries positive (negative) 
energy^ p°(—p°), we associate it with an annihilation (creation) operator.^ With this in mind, 

6 The closest friend of the real Klein-Gordon field present in the Standard Model is the complex Higgs field. 
This is the only field in the Standard Model whose quanta have not yet (2000) been discovered experimentally 
(unless, of course, one counts the longitudinal modes of the massive gauge bosons...). 

7 Here, I mean energy in the sense of a wave, rather than energy in the sense of the Hamiltonian. 

8 When ip acts on a physical state, we want the creation of a particle to take free energy away from the 
system and the destruction of a particle to give free energy to the system. In this sense, a system must use its 
wavelike (free) energy to 'make' a particle (bound energy). 
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we assume the commutators 



=S ss ,(2ir) 3 5(p-p , ) = 



with all others vanishing^ and check the canonical commutator 



x"=y" 



S(x-y) 



(D.26) 



(D.27) 



Unfortunately, our choice (D.26) does not cause this relation to be satisfied.^] The culprit 
is the lack of sign between the positive- and negative-frequency components of 



/Ve(p°)(27r) ( 5 (p 2 - m 2 ) . 

(D.28) 

This stems from the fact that there is only one derivative in the lagrangian density, flogf ), 
which is traced in turn to the form the translation operator takes in SL(2,C). Hence this 
seems to be a fundamental property of spinor fields. 

One solution to this problem is to simply generate the sign from the commutation relations. 
If we were to take 



-6 ssl (2ir) 3 6(p-jf) (D.29) 

rather than (D.26), the fields would indeed satisfy the commutation relation ( D.27] ) . However, 
this solution is exactly as artificial as it seems. It only prolongs our suffering as it leads to a 
Hamiltonian the is not bounded from below. This fact is intuitively obvious. A commutation 



relation like (D.29) gives dp iS the role of creation operator. Hence the creation of a d-type 
quantum gives free energy to a state. This implies that all of our states will eventually decay to 
the state of infinite d quanta. Our aversion to this state of affairs was what led us to consider 
the fields as dynamical variables in the first place, so this 'solution' to our problem is clearly 
unacceptable. 

Alternatively, we can demand that dp tS destroy quanta and search for other sources of the 
sign. To this end, we consider a state |0) that both b and d annihilate. We would also like 
and d) to create quanta, so we require that the state dd) |0) is proportional to |0). If the 

9 In the free theory, these are all uncoupled harmonic oscillators; operators from different oscillators should 
commute. 

10 One can check this directly using the completeness relations (B.125) and (B.f56). 
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commutator [d,^] = 1, then we arrive at the old result dd^\0) = |0). However, the reason 
these two quantities are equal is that d annihilates |0). In a fundamental sense, it doesn't 
matter whether we subtract d'd to form the commutator or add it to form the anticommutator. 
On the other hand, this difference is essential to the cancelation of Dirac structures in the 
canonical commutator. If we choose instead to impose a canonical oniicommutator, we will 
find that 

{ & p,^W} = S ssI (2tt) 3 S(p- I /) = {dfta,^',.'} ' ( D ' 3 °) 
with all other anticommutators zero, satisfy all of our requirements without compromising the 
integrity of our vacuum. 

The decision to impose canonical anticommutation relations on spinor fields is not ex- 
tremely outrageous. As we have seen, these fields are fundamentally different from scalar and 
vector fields. In fact, looking at the classical fields, we would actually expect this behavior. 
Classical vector fields satisfy 

A^g^ = B v g^ , (D.31) 

while for spinor fields we have 

fe^rf = -VW Q ■ (D.32) 

Hence in classical theories the vector degrees of freedom commute, while spinor degrees anti- 
commute. From this point of view, it seems very natural that quantization imposes a commu- 
tator on vectors and an anticommutator on spinors. This argument also implies that different 
spinor fields should anticommute rather than commute, exactly as we found above. 

Whatever the reason, the imposition of anticommutators on fermionic fields changes the 
theory drastically. In particular, the statement 

H, S > 6 U =0 (D - 33) 

implies that the creation operator annihilates the one-particle state. Spinor states are either 
filled or empty; there can be no multiple occupation. This idea was first postulated by Pauli 
as a way to explain the observed atomic spectra. In non-relativistic quantum mechanics, it is 
imposed on the theory as an experimental input. In quantum field theory, it is promoted to a 
consistency requirement. This is one of the major triumphs of relativistic quantum mechanics. 
Another is the prediction of antiparticles. Since the quanta of the d operator are created by 



273 



<f> while those of the b operator are destroyed, the absence of a '&-on' is equivalent in some 
sense to the presence of a 'tf-on'. This relationship is dubbed particle-antiparticle duality. By 
convention, we take the quanta destroyed by ip as the particles, while those it creates the 
antiparticles. Thus the quark field ip q destroys quarks and creates antiquarks. Antiparticles 
were first predicted by P. A. M. Dirac |Q in 1931, and discovered in the form of the positron by 
C. D. Anderson in 1932 p7| . Dirac shared the 1933 Nobel prize in physics with E. Schrodinger 
for this advancement of the quantum theory, and Anderson won the Nobel Prize in 1936 for 
its experimental verification. 

The Hamiltonian of the (correctly quantized) free Dirac field can be guessed from ( D.20| ); 



it is a weighted sum over the 'occupation number' operators b'b and <Fd. Our particle states 

|p, S ) = V^6^ s |0) (D.34) 

are normalized such that 

(f,s'\p,s) = 2E ff 6 ss ,(27r) 3 6(p'-p) . (D.35) 
Antiparticle states are normalized in the same way. Note that the action of our fields gives 

V^)|0) = /(03^E ut ^ s ) e ^l^ s ) ' (D.36) 
ip{x)\p,s) = u(p,s)e- tp - x \0) + --- , (D.37) 

so ip>(x) creates a particle at spacetime point x and ip changes the one-particle state into the 
vacuum while providing a wavcfunction and some free energy. 

Canonical quantization of ordinary vector fields goes through in the same way as for scalars. 
The only difference is the existence of a polarization vector analogous to the wavefunction 
u(p, s) for fermions. Gauge bosons, on the other hand, are much more nontrivial to quantize 
using the method presented above. In fact, quantization of non-abelian gauge bosons in co- 



variant gauges requires the introduction of ghost fields, as discussed in Section 1.2. For these 



kinds of theories, it is much clearer to use the path integral approach presented there. 
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D.2 Spacetime Symmetries - 

The Generators of the Poincare Group 



In this section, we would like to study the implications of Poincare symmetry on a quantum 
field theory. In all of the following, we will assume that the action relevant to our theory is 
invariant under Poincare transformations of the fields, i.e. 



(D.38) 



for any set of fields <&' related to $ by a Poincare transformation. This is true of any the- 
ory whose lagrangian is a Lorentz scalar and does not depend explicitly on the spacetime 
coordinates x M . 

In a classical theory, we expect such symmetries to lead to currents and charges which are 



conserved by physical processes. As mentioned in Section B.2, invariance under translation 
leads to conservation of momentum and that under Lorentz transformations leads to conserva- 
tion of angular momentum. Let us see how this comes about. To begin with, we must define 
what we mean by a physical process. Classically, a physical process is one which makes the 
action stationary. Under an arbitrary infinitesimal variation of the fields, the action changes 
by the amountj^j 



SS = 



d 4 x 
d A x 
d 4 x 



sc sc .„ , 

8$ 88^ ^ 

M> + ^8^5$ 
58^ 

SC 



SC 



s_c_ 8 

8$ "<5<9 U $ 



5$ 



(D.39) 
(D.40) 
(D.41) 



In the last line, I have ignored a surface term. This contribution does become important in 
certain theories, most notably gravitational theories and topological models, but it will not 

11 These manipulations can be thought of in the following way. We first consider the action as a function 
of the independent fields <t> and <E> M . No one can prevent us from varying these fields independently. However, 
in the end we are interested only in the class of fields that satisfy the constraint = If we make 

independent variations, this constraint will not be preserved. On the other hand, if we correlate the field 
variations by imposing = d^S^, field configurations which satisfy "l?^ = 9^$ will continue to satisfy it 
after the transformation. In this way, we can differentiate the lagrangian with respect to the fields <!> and d M 'I > 
as though they were independent and still obtain the proper constrained variations. Of course, this procedure 
is equivalent to the more orthodox method of Lagrange multipliers. 
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bother us here. From the last line, it is apparent that the requirement of a stationary action 
for arbitrary infinitesimal field variations implies the Euler-Lagrange equation 



5C 



5<P ^Sd^ 







(D.42) 



This expression is know as the equation of motion of the theory. Each independent field in the 
theory has an associated equation of motion which is satisfied by all physical processes. 

If we now specialize our variation to a translation of spacetime coordinates, we can work 
out the implications of translation invariance. Under a translation, we have the variationsri 



8<f>(x) = e^^d^ix) 
Sd^(x) = d ll e v {x)d v ^{x) 



Substituting these expressions into ( p.39Q , we see that S changes by the amount 

SC 



SS 



d*x 



8d^ 



(DA'S) 



(D.44) 



The quantity multiplying e" combines to form d„L, as can be seen via the chain rule, so we 
are leftj^ with 



SS 



d 4 xd^e u {x) 



5C 
8d^ 



(D.45) 



Calling the quantity in brackets , we see that invariance of the action under translation 
implies that 

d fl T'* y = . (D.46) 

One often refers to this expression as the 'conservation' of T^" since it implies that the vector 
/ d 3 xT 0v (x) does not depend on time : 



(D.47) 



- I dSiT iv {x) 







12 The fact that the Pauli-Ljubanskii vector commutes with the generators of translation implies that this 
operation will not mix different elements of the same representation of the Poincare group. In other words, we 
can translate each field independently. This will not be true of Lorentz rotations. 

13 after ignoring another surface term 
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The first equality requires the volume of integration to be independent of time. In the second, 
we have used the fact that the divergence of T is zero. The third is an application of the 
divergence theorem. The integration takes place on the boundary of the volume and dSi 
represents an outward unit normal. In the fourth, we have assumed that this surface integral 



vanishes. If it does not, Equation (D.47) states that the change in this quantity with time is 
due only to the value of T iv on the boundary. Physically, we can think of T iv as the 'flow' of a 



'charge density' T av . This interpretation makes it easy to understand (D.47). It says that the 
change in total charge in a volume is equal to the amount that came in across the boundary. 
This is conservation in its most fundamental form. Since the conserved 'charge' here is due to 
translation invariance, we will call it momentum. Thus T 0l/ can be identified with a momentum 
density and T tv with a momentum flow. The integrated charge, = J d 3 xT 0fl (x), is the 
conserved total momentum of the system. For these reasons, T is called the canonical energy- 
momentum tensor of our theory. 

However nice this derivation is, it carries with a few problems. First, it is not explicitly 
covariant. This problem stems from the fact that we must choose a frame before we ask if a 
quantity changes with time. However, looking at the derivation, we certainly never used any 
specific features of the frame we were in. In fact, the whole derivation can be performed with 
any timelike vector replacing the coordinate t. The statement then becomes 



d 

dX 



J e a0lS dZ a ^v 5 v*T»» = -j e a0lS dcr^v^T 6 " , (D.48) 



where dX represents a change along the direction, v* M is a vector complementing v such that 
v* - v = 1, dH ,lva and do*™ represent volume and area integration measures,]^ respectively, and 
the integration region cannot change as one traverses the u M direction. The physical content 



of this equation is identical to that of (D.47). In fact, due to the covariance of the result, we 
can always choose a frame in which = v = (1,0,0,0) and use flD.47P .Fl 

A more fundamental difficulty with this tensor is that it is not always symmetric. It 
turns out that defining a conserved angular momentum density requires a symmetric energy- 

14 These quantities are best described in the language of 3- and 2-forms, respectively. In both cases, the 
orientation of integration is contained in their definition. Here, the orientation of the volume integration must 
also be specified. It is timelike. 

15 unless we choose lightlike, as is done in light-cone perturbation theory 1521 . 
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momentum tensor. Non-relativistically, we would expect the angular momentum density to 
have the form 

L = xx P . (D.49) 
Generalizing this equation to four dimensions, we write 

Interpreting P as a momentum density, we see that this is just the time component of the 
tensor 

Taking the divergence of this object on a, we find that 

d a M a ^ = - T"" , (D.52) 

effectively ending all of our dreams of a conserved angular momentum. 

This is not necessarily a failure. We have not yet told our calculations that the lagrangian 
for our theory is Lorentz invariant, and we cannot expect to get angular momentum conserva- 
tion for free. If T MI/ were manifestly symmetric, we would have the result that translationally 
invariant theories are automatically Lorentz invariant and possess conserved angular momenta. 
Lorentz invariant scalar theories do indeed possess symmetric canonical energy-momentum 
tensors. This is due to the simple fact that no antisymmetric two-index structure can be con- 
structed in these theories.^] However, somewhat more input is required in theories involving 
fields with nonzero intrinsic angular momentum. 

The biggest problem with T^ v occurs in gauge theories like QCD : it is not gauge invariant. 
This fact destroys any hope we ever had of interpreting any of its components as physical 
objects. T may be conserved in a gauge theory, but it makes no sense. It seems that this 
devastating blow renders all of our work useless. However, all is not lost. Above, we argued 
that the canonical momentum density cannot be used to locally construct a conserved angular 
momentum density for fields with nontrivial spin. We have also seen that imposing local 
gauge invariance requires the introduction of a gauge field, which necessarily transforms like a 

16 The structure d y '4>d v x can always be transformed away by defining new fields. 



278 



vector]^] since it is on the same footing as 9 M . Hence the inconsistency of the canonical energy- 
momentum tensor in any gauge theory is already expected on angular momentum grounds. 
This suggests that the gauge dependent part of T is related to its antisymmetric part. 

If our theory truly is Lorentz invariant, we must be able to define a conserved angular mo- 
mentum. There are several ways to do this. A straightforward approach is simply to substitute 



the infinitesimal form of a spacetime dependent Lorentz transformation into Eq.(D.39) and 
follow one's nose. This will certainly lead to a conserved total angular momentum, but the 
connection between angular momentum and momentum is completely obscured. As we saw 
above, any local correspondence between these two quantities is doomed unless the canonical 
energy-momentum tensor is symmetric. Furthermore, local angular momentum densities and 
currents defined in this way will not be gauge-invariant and therefore cannot have a physi- 
cal interpretation. Alternatively, if we can somehow define a 'new and improved' symmetric 
energy-momentum tensor, we can make the connection between momentum and angular mo- 
mentum explicit by constructing the angular momentum density locally. We will see that this 
construction also leads naturally to gauge-invariant local densities. 

Looking back on our argument that the total momentum is conserved, we see that the only 
crucial property of T is that its divergence equals zero. The addition of any divergence-free 
tensor to T will yield another tensor from which conserved quantities can be derived. However, 
this argument does not show that the new quantities are the momenta of the system. The 
conservation of V is a direct consequence of translation invariance, as we saw above. This more 
than anything else promotes its identification with the momentum of the system. Alternative 
tensors may change the momentum density, T 0fJl , allowing for a conserved angular momentum 
density, but must not change the integrated quantity V . Both of these requirements are 
satisfied by the Belinfante transformation, 

TM u _^ T> xv + 9aX Mu (E>.53) 
The antisymmetry of \ under \ia exchange guarantees the new tensor to be divergence-free. 

17 This is true for local internal symmetries. Gauge fields associated with external symmetries may have 
different transformation properties. For example, the gauge field associated with local Lorentz invariance 
transforms like a traceless rank-2 symmetric tensor. This causes the associated gauge boson, the graviton, to 
have spin-2. 
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Furthermore, the difference between the new conserved quantities and V is just the surface 
integral 

' d 3 xd aX laQ]l ' = [ d 3 xd lX ll0]u = <fdS lX m,/ , (D.54) 



which can certainly be chosen to vanish over a large enough region. 18 If there exists a x such 
that the remaining tensor is symmetric, this tensor can be used to define angular momentum 
densities whose integrals represent the conserved total angular momentum of the system. Let 
us see how the assumption of Lorentz invariance leads to the existence of such a x- 

Under the action of the Lorentz group, a scalar lagrangian transforms only because its 
argument does : 

C(x) -> C(Ax) = C(x) + X - (6^r u ) a p x P d a C(x) + 0(6 2 ) . (D.55) 

This is a translation of the same form as that considered above. Since we have already taken 
the translation invariance of the action into account, we will ignore the change in spatial 
coordinate and focus on the other transformations induced by the Lorentz group. 

The rotation generators do not commute with the Pauli-Ljubanskii vector, so they will in 
general mix different elements of irreducible representations of the Poincare group; under an 
infinitesimal global Lorentz transformation, our fields transform as[^ 



d a $ m -> SA-J^rL^-J^X 3 "^ ( D - 56 ) 



2 

2 k^iiv i a p " ^ m ~ 2 

Here, m and n label different fields in our theory. E will depend on the representation the 
fields belong to (or, equivalently, the spin) and J is the generator of coordinate rotations, 

(•Oa/j ■ (D-57) 

Under this transformation, the lagrangian itself must be invariant since we are ignoring its 
variation due to the coordinate translation. Hence Lorentz invariance implies 

n - i—ypyg* 1 — y^f) <b 1 — ( T^) F)P<& 



18 barring topological effects... 

19 Note that the index on the derivative is treated like a vector index here. 
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T^^--^ 9 ^ (D.58) 
\5d a $ 



where in the last line I have used the equation of motion, (D.42), and the fact that E M " does 
not depend on x M . 

This equation is rather interesting. Rather than giving us a divergence-free current from 
which we can construct a conserved angular momentum density, Lorentz invariance relates the 
antisymmetric part of T to a total divergence. We can use this to immediately construct the 
canonical angular momentum current density 

M a ^ = x"T ai/ - x v T a " - i— — (D.59) 

<S<9 Q $ 

which follows directly from Lorentz invariance. As is usually the case with canonical quantities, 
this object is easy to interpret : the first two terms are due to orbital angular momentum while 
the last is due entirely to intrinsic spin angular momentum. Unfortunately, M also displays 
one of the less favorable attributes of canonical quantities - it is gauge-dependent. 
Alternatively, notice that the symmetric part of T can be written as 

T (p,v) = Tti v _ * ( J^L E /^ $ ] _ ( D 60 ) 

2 \5d a $ J 



This is reminiscent of a Bclinfantc transformation, (D.53), but our version of x is not antisym- 
metric in a[i. We can fix this by subtracting a term in which a /a, but the tensor we are left 
with is no longer symmetric. Fortunately, the term we need to add to restore the symmetry is 
automatically antisymmetric in \mx due to the antisymmetry of S; the tensor 

SC 



5d a $ 

(D.61) 



is both symmetric and physically indistinguishable from T. In gauge theories, we will also find 
it to be gauge-invariant. This is the tensor which can truly be interpreted as the momentum 
current density of our system. According to the above discussion, we can use to form the 
angular momentum current density 
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This improved Belinfante form differs from M locally, but one can show by direct substitution 
that the conserved total angular momentum 



J" 



d 3 xM°^(x) 



(D.63) 



is unaffected]^] 

While the above discussion is perfectly fine in classical theories, some modification is clearly 
needed before these results can be applied to quantized fields. In the path-integral approach, 
the transition is quite clean. We begin with the vacuum matrix element 



TO(xi, ...,x n ] 



(D.64) 



where O is an arbitrary^] collection of field operators depending on the spacetime coordinates 
Xi,...x n . Let us ignore the normalization factor Z[0] for the moment and focus on the path 
integral. Since <E> is nothing but a dummy variable of integration, it is clear that 



J V[$p(x lt . . . , x n )e*W = J £>[$']£>' . . . , Xn y s ^ 



(D.65) 



where O' is the functional O evaluated at the field configuration Taking $'(.t) = $(x) + 
<5$(cc), with <5$ infinitesimal but otherwise unconstrained, we rewrite (D.65) asp| 

(5&{x)' 



J V[$p(xi,... ) x n )e iS W = J V[$\ det( 



0(xi, ...,x n ) + 



Let us consider the arbitrary variation 



6®(x) + i S ^y\ s^(x)0(xi, ...,x n ) 



S$(x 



5$(x) = e(x) 



(D.66) 



(D.67) 



In this case, our variation is merely a field-independent shift of the integration variable. Hence 
the Jacobian is trivial. This immediately leads to the relation 



/ 



2>[3] 



^m^ + i^H X )oix u ...,x n ) 



ism = 



(D.68) 



20 once again, ignoring surface terms. 

21 In a gauge theory, we will require it to be gauge invariant. 



22 Recall that repeated indices imply summation. Here, the repeated 'index' is the spacetime variable x. Also 
note that the determinant is inverted for Grassmann fields. 
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Writing 



6S[&] 
8<f>(x) 



<5$(x) 



6C(x) 8£{x) 



e(x) 



and asserting the arbitrariness of e(x), we see that 



T 



SC(x) SC(x) 
5®(x) ~ "Sd^x) 



0(xi, . . . , X n ) 



i T 



SOixi, ...,x n ) 



(D.69) 



(D.70) 



5<f>(x) 

The right-hand-side of this equation is a series of contact terms which involve 5{x — Xi) for 
some i € {1, n}, while the left-hand-side is a Green's function involving the equation of motion 
operator 

M- 9 »6K*- (DJ1) 



Equation (D.70) is an extremely useful relation. In particular, it tells us that all physical 



matrix elements of (D.71) vanish since in that case the operator O represents asymptotic 
fields and will have arguments only at infinity. 

At this point, it is obvious how to proceed with other variations of the fields <E>. All we 
need is a transformation which leaves the measure of the path integral invariant 23 to arrive at 
an identity of the form 

,50(xi,...,x n y 



(T{C(x)0( Xl 



On)}) =i(T 



6®(x) 



(D.72) 



where C is a collection of fields which is classically zero. In particular, the classical relations 
involving conservation of energy-momentum and angular momentum can easily be cast into 
this form since the transformations of translation and Lorentz rotation leave the measure 
invariant.^] 

The lagrangian density of massless QCD is 

1 



from which we derive^ 



(D.73) 



(D.75) 



23 Actually, useful identities can be derived even when the measure is not left invariant. One extremely 
well-known example is the derivation of the Adler-Bell-Jackiw anomaly pi) . 

24 For fermions, we can see this as a consequence of the invariance of the product ipip, while for vector bosons 



B.l 



it is due to the fact that det A = rfc 1 for the Lorentz transformations introduced in Section 

25 These relations are not unique. In particular, the replacement iT>—>iT> will affect the first four. However, 
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SC 
5C 
5C 
SL 



= i>i^ (D.76) 

= ilp^ (D.77) 

= (D.78) 

= - g ^t a ^-gf ahc ^ a A c a (D.79) 

= -F% v ■ (D.80) 



These relations, along with the Lorentz transformation properties 

= (D-81) 
E£ u = -iJ" v , (D.82) 

contain the QCD equations of motion, energy-momentum tensor, angular momentum current 
density, and any other symmetry currents oQ QCD. Some useful relations are summarized in 



Section A. 4 



One very useful identity that can easily be proved at this point is the fundamental Ward 
identity of QCD. Taking 

M£(s) = 0"e„(aO , (D.83) 

the equations of motion and physical tensor densities are not affected by this ambiguity. We also note that 
the above derivation assumes commuting, rather than Grassmann, fields. To generalize our expressions, it is 
sufficient to consider Grassmann derivatives as coming from the direction of the variation. For example, 

<5$ Sip 

C^-Si, . (D.74) 
dtp 

26 unrenormalized and un-fixed 
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( )D.72| ) becomes^ 

The expression in brackets is constructed to be the current the gluon field couples to, so 
this equation tells us that Green's functions involving a gluon current contracted with its 
momentum are reduced to contact terms. In particular, it implies transversality of the gluon 
propagator and conservation of the color charge. In this sense, the terms on the right-hand- 
side act as sources for the color current. Nonzero contact terms imply a color source, which 
will affect the total charge. Note that this identity is also obvious from the classical equation 
of motion of the gluon field, 

Vtfjf = g^ a t a ^ , (D.85) 

and the antisymmetry of the field strength. 

An analogous identity implies the conservation of quark flavor (or, more generally, baryon 
and lepton number). Taking 

6ip(x) = ia(x)ip(x) , Sip(x) = —ia(x)ip(x) (D.86) 

gives^| 

(T {W^WOf,, . . .,*.)}) = (T {^) S °%- x) ' Xn) ~ 6 °% x) ,Xn) ^)}) • 

(D.87) 

This relation also finds use as the fundamental Ward identity of QED. 

These identities have all been derived ignoring the necessary steps of renormalization and 
gauge-fixing. The fact that these unrenormalized and un-fixed Ward identities imply similar 
relations among the fully renormalized gauge-fixed Green's functions is nontrivial to show. 
Nonetheless, it has been done in the covariant gauges and the light-cone gauge [J44J. 

27 Derivatives always belong outside the matrix element. The easiest way to see this involves the sources in the 
path integral approach. Since our sources bring down powers of the fields rather than their derivatives, we must 
differentiate externally. Among other things, this implies that derivatives will always generate contributions 
from the time-ordered product. These form the contact terms on the right-hand-side of our matrix element 
identities. As mentioned in Section 2.5, these contact terms can systematically be removed by considering the 
modified T* product. See [pj| . 

28 Once again, we must take care with these anticommuting derivatives. 
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Once these renormalized and gauge-fixed Ward identities are proven, the renormalizability 
of the whole theory is just a step away. Thanks to these identities, all of the ultraviolet 
divergences present in the theory are constrained to take forms which respect our symmetries. 
Since we have chosen the most general lagrangian consistent with the symmetry, this implies 
that all ultraviolet divergences can be absorbed into counterterms, which in turn implies 
renormalizability. 
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Appendix E 



Dimensional Regularization 



This Appendix is meant to familiarize the reader with the standard regularization procedure 
used in quantum field theory. Introduced by G. l t Hooft and M. Veltman in 1972 jHj, this 
procedure exploits the analytic behavior of integrals as functions of their dimensionality. Since 
the ultraviolet (and infrared) divergences wc meet in the text are in some sense accidents of the 
fact that we live in four dimensions, these integrals prove entirely convergent when evaluated 
in less (or more) spacetime dimensions. The subtleties associated with the actual meaning of 
a 3.5-dimensional space are completely avoided here; some discussion can be found in |4C[ |. We 
will view dimensional regularization exclusively as a consistent regularization procedure for our 
amplitudes. 

It can be shown quite generally that the coordinates 

x% = p cos 01 

xi — p sin 9\ cos 02 

23 = p sin 61 sin 62 cos #3 

4-1 

Xi = p rT s ^ n ^ c ° s $i (e.i) 

n-1 

X{n-i) = P II sin0j cos0 (n _i) 

3=1 
n-1 

x n = p]Jsm6 3 (E.2) 

3=1 
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span a Euclidean n-dimensional space for integer n > 1. The angles 6i all take values from 
to 7r except which takes values on [0, 2tt). The 2-norm of the vector x is given by p 

and the (n — 1) angles specify a location on the (n — l)-dimensional sphere defined by p. The 
line clement in these coordinates 



n-l I i-i \ 

ds 2 = dp 2 + P 2 de 2 + p 2 J2 II sin2 6 i dd i ( E - 3 ) 

i=2 \j=l / 

defines the volume element 

ra— 1 /n—1 \ n— 1 

dx= v/det 5tf rfp JJ dd { = p™" 1 ]J sin"^ 1 oA dp J[ dOj . (E.4) 

i=l \*=1 / j=l 

The hyper- area of our (n — l)-dimcnsional sphere then becomes 

ri-2 



which upon using 



reduces to 



( „_i) = 2?r TT / sin"-*" 1 ^^ , (E.5) 
i=i 70 

r2 W) _ r ^ 



/ sin" 0d0 = 2" / 2 = V^F / 2 \ 

7o r(n + i) v r(f + i) 



(E.6) 



2W 2 
r (f) ' 

For any function /(x) that depends only on the magnitude p of x we have 



J dx/(x) = fl M /" dpp n - x f{p) . (E.8) 

This expression is correct for any positive integer dimension n. Furthermore, these integrals 
respect shift invariance 

J dx/(x + y)= y dx/(x) , (E.9) 
for any constant vector y, and the dilation identity 

y dx/( fl x)=i y dx/( X ) , (e.io) 

where a is an arbitrary real constant. 
We can define the functional 

l d [.f](yu...,y N ) (E.ll) 



288 



as a meromorphic function of d that satisfies 

Z"«j[/](yi + y,...,y* + y) = T d {f\{ yi ,...,y N ) (E.12) 

2"« J [/](ayi,...,oy JV ) = |o| d o*'I <J [/](yi,...,y^) (E.13) 

and takes the value 

Zn[/](yi,...,y;v) = y"dx/(x, yi ,...,y w ) (E.14) 

for n G -Z+.jj] The constant </>/ is a scale factor which depends only on /. This does not fix 
our functional uniquely. One can add any analytic function of d which vanishes at all positive 
integers to Id to obtain an equally acceptable functional. This ambiguity will be resolved 
shortly. 

The vectors { y, } define at most N directions, so in M > TV integer dimensions Im contains 
at most N directionally-dependent integrals. Doing these integrals explicitly gives 

Zjtf[/](yi,---,yAr) =1m-n [2jv[/]] (yi,...,yj\r) , (E.15) 

where the remaining (M — N) integrals are direction independent, i.e. of the form of (|E.8| ). 



If Id is to be a meromorphic function of d constrained by (E.14), it must also satisfy this 
reduction identity : 

Z d [f}(yi,...,y N )=ld-N[lN[f]}{yi,---,yN) ■ (E.16) 

This immediately implies that we need not consider directionally-dependent functions /. These 
can always be reduced to directionally-independent functions by doing an integer number of 
integrations J^| 

With this in mind, we can use ( |E.8|) as a guideline to define a unique functional. For any 
given directionally-independent function /(/?), we define 

l d [f] = / dpP^fip) . (E-17) 

where the integral on the right is to be evaluated in a suitable region^] of the complex plane of 
d and analytically continued elsewhere. Since it has been constructed to look like an integral, 

unless this integral is not well-defined. We will see that these cases are handled quite nicely by the other 
properties of I d . 

2 although this is not always the most practical way to extract the value. 
3 This is taken to mean anywhere the integral is well-defined. 
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one often uses the notation 

2d[/](yi,---»yjv) = / ' dxf(x,yi,---,y N ) , (E.18) 

but it must always be understood that this is simply a convenient way to write things. In 
this form, the shift and dilation invariance imply that one can manipulate the variable of 
integration as in a normal integral. However, nonlinear changes of variables require care; the 
interested reader is referred to Q. 

Now that we know what a 'd-dimensional integral' means, let's compute some. First and 
foremost, any functional ld[f\ whose arguments {y^} are all zero is identically zeroj^] This 
follows from the dilation property, 

l d [f](0,...,0) = \a\ d a*fl d [f}(0,...,0) , (E.19) 



and the analytic structure of I d as a function of d. This result is somewhat striking since it 
implies that the integrals 

J dx(p 2 ) n (E.20) 
all take the value zero. However, as a consequence of ( E.13Q , it is unalterable.^] Using (E.1S) 



in conjunction with ( E.12 ), one can also easily show that 

X d [/](y,...,y) = . (E.21) 

Apparently, T d must have two vectors on which to depend if it is to be nonzero]^] 

Throughout the above, I have ignored the dependence of X d on scalar quantities. This 
dependence must, of course, be taken into account if one is to have a general functional. In 
particular, the direction-independent integrals we have asserted to be the only necessary con- 
sideration depend exclusively on scalar parameters. Since scalar quantities are not affected by 



shifts in the integration variable, there is no analogue of ( E.12 ) for these quantities. However, 
these parameters are affected by dilation : 

I d [f]{a\i,...,a\ N ) = \a\ d a*fl d [f](\ l> ...,\ N ) . (E.22) 



4 We need all scalar arguments to be zero as well; see below. 

5 Integrals in positive integer dimensions do not satisfy this equation. As a consequence, they are ill-defined. 

(i 



Note that these vectors can be taken as y and if one wishes.. 
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This relation again implies that paramctcrlcss integrals are necessarily zero. However, the lack 
of shift invariance means that we only need one parameter to obtain a finite result. In quantum 
field theory, we will find that the only truly d-dimensional integral one needs to compute is 

1 dk \,t 77^3 ■ ( E - 23 ) 



(2n) d ( fc 2 + M 2 f i^) d/2 T (f ) Jo (fe2 + M 2 f 

For sufficiently small odd d and integer a, (3, one can extend the integration region to negative 
infinity and take the integral by contour. If we are very strong, we can show that this givesQ 

1 r(d/2 + a)T(J3 -a- d/2) (£ ^ 



(47r) d / 2 v ; T(d/2)r(/3) 

Analytically continuing this result to the full complex plane of a, f3, and d gives the required 
general integral. 

Dimensional regularization is a procedure in which one systematically replaces all four- 
dimensional loop-momentum integrals by d-dimensional ones. In this way, the ultraviolet 
divergences present in our theory are represented by poles at d = 4. As mentioned above 
this process is viewed exclusively as a way to exhibit ultraviolet (and infrared) divergences 
explicitly as well-defined quantities. From this perspective, one would expect no changes in 
the theory to be necessary. Unfortunately, small things have a way of infiltrating every aspect 
of a quantum field theory. Interpreting a loop-momentum integral as <i-dimensional implies 
in turn that the loop-momenta are themselves d-dimensional, which implies that the metric 
used to form invariant products between them is d-dimensional. This fact changes the whole 
structure of the spacetime symmetry group of our theory. 



Looking back at Section B.l, we see that things are not so bad. Almost all of the constructs 



of that section can be accommodated in d dimensions |j We have to write 

g^gT = d (E.25) 

rather than four and we find ourselves with d(d — l)/2 rotation generators rather than six. 
We can easily keep track of such factors. The only change in the above formalism is that / 
must now be allowed to depend on d, which we would've had to consider at higher orders 



7 A much more elegant derivation is given in J4(| . 

8 To minimize the change in structure to our group, we add only spacelike coordinates. 
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anyway.[] These modifications will affect physical amplitudes when multiplied by poles at 
d = 4. Renormalized quantities are guaranteed to be finite, but the divergences are certainly 
not guaranteed to leave no trace. 

The intimate relation between «SO(3, 1) and SL(2, C) requires us to continue our Dirac 
space into d dimensions as well. The smallest dimensionality in which we can find d 7-matrices 
which satisfy the Clifford algebra 

= 2<T (E.27) 
is 2 d / 2 x 2 d / 2 , so we should modify the normalizations 

7 M 7„ = d (E.28) 
Tr 1 = 2 d/2 . (E.29) 

Since the trace appears in all amplitudes, it constitutes an overall factor. As such, it can be 
chosen not to leave a residue by including its effects with the ultraviolet divergence (along 
with the "fE and log47r in the MS scheme, for example). For this reason, people usually just 
take Tr 1 = 4. 

In addition to the above changes, we must somehow make a connection between the d- 
dimcnsional Euclidean space our integrals are defined in and the ei-dimensional Minkowskii 
space in which our theory is defined. The simple solution to this is to rotate our energy 
integration axis 90° in its complex plane. This has the effect 

k° -> ik% (E.30) 
k 2 -> -k% . (E.31) 

Since we have constructed our propagators to have a well-defined pole structure, this rotation 
can be justified for our Feynman integrals by Cauchy's theorem]^] All we must do is keep 



9 This does cause a slight problem with our derivation of (E.19) in the case <f> = —d. The result of this is 
that the integral 

d d k 

(E - 26) 

is not uniquely defined. We cannot take its integer-dimension value since this is also not well-defined, yet we 



cannot prove that it is zero using the scaling argument. Even ( E.24 ) is poorly defined in this case. However, 
since it is not determined, we are free to define it as zero. 

10 We can perform this shift provided we don't cross any poles in the process. One can check explicitly that 
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track of the sign and the factor of i. At the end of the calculation, we simply rotate back to 
obtain amplitudes with physical momenta. 

Unfortunately, all of the sectors of our theory are not so readily expressed in d dimen- 
sions. One can trace all of the difficulties to the Levi-Cevita tensor, e MI/a/3 . Looking back 



at its definition (B.19), we see that this object appears because of the relation between the 
determinant of A and its components. A similar object can be defined in n dimensions as a 
totally antisymmetric n-index tensor. Its invariance is based on its equal treatment of all of 
the dimensions of our space. We have no analogue of this in d dimensions. Mathematicians 
tell us that if we really want to take a non-integer-dimensional space seriously, its vectors will 
have an infinite number of components,^] an infinite number of which vanish as d approaches 
a positive integer. Since we do not intend to introduce tensors with an infinite number of 
indices into our calculations (and have no idea how we would if we did intend!), we cannot 
determine a definition for e in our dimcnsionally regularized theory. As before, this means 
that we are free to define one however we wish, provided that it is consistently Our studies 



of renormalization in Section 1.4 imply that we can choose whatever regularization scheme we 
want, as long as we stick with our scheme once it is chosen. The most obvious choice is simply 
to keep the four-dimensional definition. This carries the price of breaking the full rotational 
symmetry of the d-dimensional theory, but it is difficult to imagine a definition that will not. 

When products of two e tensors appear, we can pretend not to break the symmetry by 
eliminating the e's via 

^ aP e\ P ar = -S£5 u p 6*6P + permutations . (E.32) 



Once we have everything in terms of <5's, (E.25) makes us feel like we are truly working in d 



dimensions. However, the reduction identity (E.32) is derived in four dimensions by working 



out the implications of the statement, "If the product of two epsilons is not zero, then each 
index on one must have a twin on the other." . This statement is obviously not generalizable 



to d dimensions. On the other hand, one can use (E.32) to define an e scheme. This option 



this leads to no problems here, unless one quantizes the theory in axial gauge with a pathological prescription 
for the spurious singularities. See Section 1.6. 

11 some discussion can be found in |4p[ . 

12 It also should not do irreparable damage to Lorentz invariance. 
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is described extensively in fl99fl . Yet another scheme which considers e a totally antisymmetric 
tensor with four indices in the full ti-dimensional space has been brought forth in connection 



with a non-associative Dirac- Clifford algebra [IOC] 



In the fermion sector, the Levi-Cevita symbol's ambassador, 75, poses similar difficulties. 
Because of the role 75 plays in fermion hclicity projection operators and the prominence of 
fermions in real world applications, different choices for 75 's extension have been extensively 



studied in the literature. The schemes most often seen are Bardeen's anticommuting 75 [101] 
and 't Hooft and Veltman's 75 prescription |l0| . The first of these defines 75 through the 
relation 

{7^,75} = , (E.33) 

which is taken to be valid for all fi. Although this is a natural extension, it is algebraically 
inconsistent with a nonzero trace. Beginning with 

dT-75 = Tr7 A 7A75 

= -Tr 7 A 75 7 A (E.34) 
= -dTr 75 , (E.35) 

one can show that 

d(d-2)(d-4)---(d-2n)Tr 757^7^ ••■7 M2 " =0 (E.36) 

for any integer n. This is expected from the relation 75 has with e; the purpose of 75 is to 
force the presence of all directions. However, a 75 which does not give finite traces is useless 
to us. It has been suggested that dropping some other property of the spinor algebra may lead 



to a consistent scheme. Examples are the algebra's associativity [10C] and the trace's cyclicity 



The 't Hooft and Veltman scheme is analogous to the scheme introduced above for the 
Levi-Cevita symbol. One simply clings to the four-dimensional definition of 75, 

75^7°7V7 3 , (E.37) 

in d dimensions. This scheme has the advantage of being fully consistent. However, it explicitly 
breaks the full symmetry of our d-dimensional spacetimc. In particular, its commutation 
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relations with the 7^ read 

{ 7 M ,75> = ; ^€{0,3} (E.38) 
[7", 75] =0 ; n<£{0,3} . (E.39) 

This reckless destruction is not without its price. In order to ensure that the nonsinglet axial 
current is conserved, one must perform an extra finite renormalization. However, once this is 
done, we are guaranteed a consistent result. 

In some sense, all of this discussion of 75 is academic for application to QCD. Since 75 does 
not appear in the QCD lagrangian, there is no real reason to introduce it. In a vector theory, 
75 functions mainly as a projector to help simplify calculations. On the other hand, a theory 
like the Standard Model whose gauge bosons couple to a partially axial current requires a truly 
consistent way to separate the positive- and negative helicity components of the Dirac field. 
Here, a scheme like that of 't Hooft and Veltman's may do irreparable damage to the gauge 
theory and lead to nonrenormalizability. All of the standard proofs that the Standard Model 
is renormalizable assume an ultraviolet regulator that does not break the gauge symmetry 
However, here it is certainly not clear that dimensional regularization offers such a regulator. 



Studies of the Ward identities in such theories can be found in [103]. 

Aside from the difficulties associated with parity-odd structures, dimensional regularization 
has proven itself an extremely useful tool. The simplicity of its integrals coupled with its respect 
for symmetry allow one to trust the correctness of one's calculation. One other major advantage 
of this prescription is the fact that it can be used to regulate infrared as well as ultraviolet 
divergences. Historically, one has had to keep infrared divergences at bay by introducing 
offshellness or gauge boson masses. Since the former does not allow the calculation of onshcll 
physical matrix elements while the latter does violence to non-abelian gauge symmetries, these 
options are clearly not acceptable. Dimensional regularization allows one to calculate onshcll 
matrix elements without breaking the gauge symmetry through gauge masses. 

However, one cannot simply regulate both infrared and ultraviolet divergences simultane- 
ously. This is obvious in a massless theory, where all corrections would give exactly zero due 
to lack of scale. The correct procedure begins with a calculation of offshcll Green's functions. 
These will have ultraviolet divergences which are to be regulated dimensionally by taking 
d < 4. Through renormalization, we can re-express our amplitude in terms of renormalized 
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quantities which are ultraviolet finite. At this point, dimensional rcgularization is no longer 
needed to regularize ultraviolet physics, so we are free to analytically continue our amplitude 
to d > 4 where it is well-suited to handle infrared physics. It is now safe to take the onshell 
limit without worrying about unregularized divergences. 

A list of useful formulae for <i-dimensional integration, along with some helpful properties 



of the T-function, is given in Section A. 3 



296 



Bibliography 



[1] The original references are 

A. Einstein and M. Grossman, Z. Phys. 62(1913)225 
and 

A. Einstein, Ann. Phys., 4, 49(1916)769. 

For a standard modern treatment, see for example 

C. W. Misner, K. S. Thorne and J. A. Wheeler, Gravitation, Freeman and Co., New York, 
1973. 

[2] S. L. Glashow, Nucl. Phys. 22(1961)579. 
[3] S. Weinberg, Phys. Rev. Lett. 19(1967)1264. 

[4] A. Salam in Elementary Particle Theory, ed. N. Swartholm, Almquist and Wissell, Stock- 
holm, 1968. 

[5] For a review of recent precision QED experiments and calculations, see 

T. Kinoshita, ed., Quantum Electrodynamics, World Scientific, Singapore, 1990 
and 

T. Kinoshita in History of Original Ideas and Basic Discoveries in Particle Physics, H. 
Newman and T. Ypsilantis, eds., Plenum Press, New York, 1995. 

[6] H. Dehmclt, P. Schwinberg, and R. VanDyck, Jr., Phys. Rev. Lett. 59(1987)26. 

[7] H. Becquerel, Comptes Rendus 122(1896)420; 122(1896)501. 

[8] M. S. Curie, Comtes Rendus 126(1898)1101; 

P. and S. Curie and G. Bemont, Comptes Rendus 127(1898)1215. 



297 



[9] A. Pais, Inward Bound, Clarendon Press, Oxford, 1986. 

[10] G. 't Hooft, Nucl. Phys. B35(1971)167; 

G. 't Hooft and M. Veltman, Nucl. Phys. B44(1972)189. 

[11] These bosons were discovered in the UA1 and UA2 experiments at the CERN pp collider 
in 1982-3; 

UA1 Collaboration, G. Arnison ct al., Phys. Lett. B122(1983)103 and Phys. Lett. 
B126(1983)476; 

UA2 Collaboration, M. Banner et al., Phys. Lett. B122(1983)476 and P. Bagnaia et al., 
Phys. Lett. B129(1983)130. 

[12] H. Geiger and E. Marsden, Proc. Roy. Soc. A82(1909)495. 

[13] J. J. Thompson, Phil. Mag. 48(1899)547; 
Kelvin, Phil. Mag. 6(1902)257; 
J. J. Thompson, Phil. Mag. 6(1903)673; 
J. J. Thompson, Proc. Cambr. Phil. Soc. 13(1904)49. 

[14] E. Rutherford, Phil. Mag. 21(1911)669. 

[15] E. Rutherford, Phil. Mag. 37(1919)581. 

[16] J. Chadwick, Nature 129(1932)312. 

[17] N. Bohr, Dansk. Vid. Selsk. Virksomh., June 1935-May 1936, p. 39; 
N. Bohr, Nature 138(1936)695. 

[18] See for example J. M. Blatt and V. Weisskopf, Theoretical Nuclear Physics, Wiley, New 
York, NY, 1952, Chap. 14. 

[19] H. Yukawa, Proc. Phys. Math. Soc. (Japan) 17(1935)48. 

[20] C.M.C. Lattes, H. Muirhcad, G.P.S. Occhialini, and C.F. Powell, Nature, 159(1947)694. 

[21] M. Gcll-Mann, Phys. Lett. 8(1964)214. 

[22] G. Zweig, CERN Report No. 81821 TH401 (1964), unpublished. 



298 



[23] W. Pauli, Z. Phys. 31(1925)765. 

[24] O. W. Greenberg, Phys. Rev. Lett. 13(1964)598; 

M. Y. Han and Y. Nambu, Phys. Rev. B139(1965)1006. 

[25] M. Gell-Mann, "The Symmetry Group of Vector and Axial Currents", Physics 1 (1964), 
pp.63-75. 

[26] E. D. Bloom, et al., Phys. Rev. Lett 23(1969)930. 

[27] R. P. Fcynman, Phys. Rev. Lett. 23(1969)1415. 

[28] C. G. Callan and D. J. Gross, Phys. Rev. Lett. 21(1968)311; 
J. D. Bjorkcn, Phys. Rev. 179(1969)1547. 

[29] K. G. Wilson, Phys. Rev. 179(1969)1499. 

[30] C. G. Callan and D. J. Gross, Phys. Rev. D8(1973)4383. 

[31] S. Coleman and D. J. Gross, Phys. Rev. Lett. 31(1973)851. 

[32] D. J. Gross and F. Wilczck, Phys. Rev. Lett. 30(1973)1343. 

[33] H. D. Politzcr, Phys. Rev. Lett. 30(1973)3633. 

[34] X. Ji, Phys. Rev. Lett. 78(1997)610. 

[35] X. Ji and J. Osborne, Phys. Rev. D58(1998)094018. 

[36] A. Ali, V. M. Braun, and G. Hiller, Phys. Lett. B266(1991)117. 

[37] X. Ji and J. Osborne, Eur. Phys. J. C9(1999)487. 

[38] I have found 

M. E. Peskin and D. V. Schroeder, An Introduction to Quantum Field Theory, Addison- 
Wesley, Reading, MA, 1995 

extremely helpful for a general discussion. More advanced topics can be found in 
R. J. Rivers, Path Integral Methods in Quantum Field Theory, Cambridge University, Cam- 
bridge, 1987. 



299 



[39] The original papers on the 'proper' prescription are 
S. Mandelstam, Nucl. Phys. B213(1983)149; 
G. Lcibbrandt, Phys. Rev. D29(1984)1699. 
A standard reference on this material is 

A. Bassetto, G. Nardelli, and R. Soldati, Yang-Mills Theories in Algebraic Noncovariant 
Gauges : Quantization and Renormalization, World Scientific, Singapore (1991). 

[40] J. Collins, Renormalization, Cambridge University, Cambridge, 1984. 

[41] G. 't Hooft, Nucl. Phys. B33(1971)173; 

B. W. Lee and J. Zinn- Justin, Phys. Rev. D5(1972)3121,3137; 
A. A. Slavnov, Thcor. Math. Phys. 10(1972)99; 

J. C. Taylor, Nucl. Phys. B33(1971)436; 

C. Bccchi, A. Rouet and R. Stora, Ann. Phys. 98(1976)287; 
M. Z. Iofa and I. V. Tyutin, Theor. Math. Phys. 27(1976)316. 

[42] M. E. Peskin and D. V. Schroeder, An Introduction to Quantum Field Theory, Addison- 
Wesley, Reading, MA, 1995, Chapter 12. 

[43] See for example 

R. K. Ellis, W. J. Stirling and B. R. Webber, QCD and Collider Physics, Cambridge 
Univeristy Press, Cambridge, UK, 1996. 

[44] A. Bassetto, M. Dalbosco, I. Lazzizzera and R. Soldati, Phys. Rev. D36(1987)3138; 
S. Mandelstam, Nucl. Phys. 213(1983)149; 
G. Leibbrandt, Phys. Rev. D29(1984)1699. 

[45] G. Curci, W. Furmanski and R. Petronzio, Nucl. Phys. B175(1980)27. 

[46] W. Furmanski and R. Petronzio, Phys. Lett. B97(1980)437; 
R. K. Ellis and W. Vogelsang, hcp/ph-9602356; 
W. Vogelsang, Nucl. Phys. B475(1996)47. 

[47] G. Sterman, Phys. Rev. D17(1978)1789, 2773. 

[48] S. Libby and G. Sterman, Phys. Rev. D18(1978)3252, 4737. 



300 



[49] J. C. Collins and G. Stcrman, Nucl. Phys. B185(1981)172. 

[50] L.D. Landau, Nucl. Phys. 13(1959)181. 

[51] H. Lehmann, K. Symanzik, and W. Zimmermann, Nuovo Cim. 1(1955)205. 

[52] J. C. Collins and D. E. Soper, Nucl. Phys. 194B(1982)445. 

[53] J. S. Bell and R. Jackiw, Nuovo Cimcnto 60A(1969)47; 
S. L. Adlcr, Phys. Rev. 117(1969)2426. 

[54] V. N. Gribov and L. N. Lipatov, Sov. J. Nucl. Phys. 15(1972)438; 
L. N. Lipatov, Sov. J. Nucl. Phys. 20(1975)95; 
G. Altarelli and G. Parisi, Nucl. Phys. B126(1977)298; 
Yu. L. Dokshitzer, Sov. Phys. JETP 46(1977)641. 

[55] J. C. Collins, D. E. Soper and G. Stcrman, in Perturbative QCD, ed. A. H. Mueller, World 
Scientific, Singapore, 1989. 

[56] J. C. Collins, L. Frankfurt, and M. Strikman, Phys. Rev. D56(1997)2982. 

[57] R. K. Ellis, W. Furmanski, R. Pctronzio, Nucl. Phys. B207(1982)l; B212(1983)29; 
J. Qiu, Phys. Rev. D42(1990)30. 

[58] S. Coleman and R.E. Norton, Nuovo Cim. 28(1965)438. 

[59] T. Kinoshita, J. Math. Phys. 3(1962)650; 

T. D. Lee and M. Nauenberg, Phys. Rev. B133(1964)1549. 

[60] G. Sterman, An Introduction Quantum Field Theory, Cambridge University Press, Cam- 
bridge, 1993. 

[61] W. A. Bardeen, A. J. Buras, D. W. Duke and T. Muta, Phys. Rev. D18 (1978)3998. 
[62] R. Mertig and W. L. van Neervcn, Z. Phys. C70(1996)637. 

[63] C. Itzykson and J. B. Zuber, Quantum Field Theory, McGraw-Hill Inc., New York, NY, 
1980. 



301 



[64] R. L. Jaffe and X. Ji, Phys. Rev. Lett. 67(1991)552; 
X. Ji, Nucl. Phys. B402(1993)217. 

[65] D. Gross and F. Wilczek, Phys. Rev. D9(1974)980. 

[66] X. Ji, J. Tang and P. Hoodbhoy, Phys. Rev. Lett. 76(1996)740. 

[67] A. V. Radyushkin, Phys. Lett. B380(1996)417; Phys. Lett. B385(1996)333; Phys. Rev. 
D56(1997)5524. 

[68] X. Ji, Phys. Rev. D55(1997)7114. 

[69] J. Bartels and M. Loewe, Z. Phys. C12(1982)263; 
B. Geyer et al., Z. Phys. C26(1985)591; 
T. Braunschweig et al, Z. Phys. C33(1986)275; 
F.-M. Dittes et al., Phys. Lett. B209(1988)325; 
I. I. Balitsky and V. M. Braun, Nucl. Phys. B311(1989)541; 

P. Jain and J. P. Ralston, in Future Directions in Particle and Nuclear Physics at Multi- 

GeV Hadron Beam Facilities, BNL, March, 1993; 

I. I. Balitsky and A. V. Radyushkin, Phys. Lett. B413(1997)114; 

J. Blumlein, B. Geyer, and D. Rabaschik, Phys. Lett. B406(1997)161; 

X. Ji, W. Melnitchouk, X. Song, Phys. Rev. D56(1997)5511; 

L. L. Frankfurt, A. Freund, V. Guzey, and M. Strikman, frep-ph/9703449 ; 



V. Yu. Petrov et al., bep-ph/9710270 



A. V. Bclitsky, B. Geyer, D. Muller and A. Schafer, hcp-ph/9710427 



L. Mankiewicz, G. Piller, E. Stein, M. Vanttinen, T. Weigl, hep-ph/9712251 



[70] Z. Chen, Columbia preprint CU-TP-835, |hcp-ph/9705279 



[71] K. Watanabe, Prog. Th. Phys. 67(1982)1834. 

[72] D. Muller, D. Robaschik, B. Geyer, F.M. Dittes, and J. Horejsi, Fortschr. Phys. 
42(1994)101. 

[73] D. Muller, Phys. Rev. D58(1998)054005; 

A. V. Belitsky and D. Muller, Phys. Lett. B417(1998)129. 



302 



[74] L. Mankiewicz, G. Piller, E. Stein, M. Vanttinen, T. Weigl, TUM-T39-97-31, e-Print 



Archive: hep-ph/9712251 



[75] X. Ji and J. Osborne, Phys. Rev. D57(1998)1337. 
[76] P. Hoodbhoy and X. Ji, Phys. Rev. D58(1998)054006. 

[77] G. Altarelli, R. D. Ball, S. Forte and G. Ridolfi, Nucl. Phys. B496(1997)337. 

[78] X. Ji, J. Phys. G24(1998)l. 

[79] S. A. Larin, Phys. Lett. B303(1993)113. 

[80] The original references are 

P. Carruthers, Phys. Rev. D2(1970)2265; 
R. J. Crewther, Phys. Rev. D3(1971)3152. 
For a general discussion, see 

M. E. Peskin and D. V. Schroder, An Introduction to Quantum Field Theory, Addison- 
Wesley, Reading, MA, 1995, Chapter 19. 

[81] M. Stratmann, Z. Phys. C60(1993)763; K. Abe et al., Phys. Rev. D58(1998)112003. 

[82] I. I. Balitsky, V. M. Braun, Y. Koike, and T. Tanaka, Phys. Rev. Lett. 77(1996)3078; 
A. V. Belitsky, Phys. Lett. B405(1997)312. 

[83] J. Kogut and D. Soper, Phys. Rev. Dl(1970)2901; 
E. Tomboulis, Phys. Rev. D8(1973)2736. 

[84] E. V. Shuryak and A. I. Vainshtein, Nucl. Phys. B199(1982)951. 

[85] A. P. Bukhvostov, E. A. Kuraev, and L. N. Lipatov, JETP Lett. 37(1983)482; 
A. P. Bukhvostov, E. A. Kuraev, and L. N. Lipatov, Sov. Phys. JETP 60(1984)22; 
P. G. Ratcliffe, Nucl. Phys. B264(1986)493; 
I. I. Balitsky and V. M. Braun, Nucl. Phys. B311(1988/89)541; 
X. Ji and C. Chou, Phys. Rev. D42(1990)3637; 
J. Kodaira et al., Phys. Lett. B387(1996)855. 



303 



[86] R. L. Jaffe and X. Ji, Phys. Rev. Lett. 67(1991)552; 

R. L. Jaffe and X. Ji, Phys. Rev. Lett., Nucl. Phys. B375(1992)527. 

[87] X. Ji, Nucl. Phys. B402(1993)217. 

S. Weinberg, The Theory of Quantum Fields Vol. I, Cambridge University, Cambridge, 
1995. 



[90 



[91 



[92 



[93 
[94 
[95 

[9G 
[97 



E. P. Wigner, Ann. Math. 40(1939)149. 

M. Hamermesh, Group Theory and Its Application to Physical Problems, Addison- Wesley 
Publishing Co., Reading, MA, 1962. 

H. M. Chan and S. T. Tsou, Some Elementary Gauge Theory Concepts, World Scientific, 
Singapore (1993). 

G. 't Hooft, Nucl. Phys. B72(1974)461; 
E. Witten, Nucl. Phys. B160(1979)57; 
A. V. Manohar, |hcp-ph/9802419| . 



R. Dashen and A.V. Manohar, Phys. Lett. B315(1993)425; B315(1993)438. 



T. D. Cohen and C. K. Chow, hcp-ph/0003131 



O. Klein, Z. Phys. 37(1926)895; 
W. Gordon, Z. Phys. 40(1926)117. 

P. A. M. Dirac, Proc. Roy. Socy. 133(1931)60. 

C. D. Anderson, Science 76(1932)238; Phys. Rev. 43(1933)491. 

K. Fujikawa, Phys. Rev. Lett. 42(1979)1195; 
K. Fujikawa, Phys. Rev. D21(1980)2848. 

[99] P. Breitenlohner and D. Maison, Commun. Math. Phys. 52(1977)11. 

[100] S. Okubo, Prog. Theor. Phys. Supp. 86(1986)287. 

[101] W. A. Bardeen, R. Gastmans and B. Lautrup, Nucl. Phys. B46(1972)319. 



304 



[102] J. G. Korncr, D. Kreimer and K. Schilchcr, Z. Phys. C54(1992)503. 

[103] B. A. Ovrut, Nucl. Phys. B213(1983)241. R. Ferrari, A LcYaouanc, L. Oliver and J. C. 
Raynal, Phys. Rev. D52(1995)3036. 



305 



